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Preface 


This “book” is based on lectures I gave at TASI during the summer of 2013. This 
document is not intended as a reference work. It is certainly not encyclopedic. It 
is not even complete. This is because these are lectures for graduate students, and 
I aimed at pedagogy. So don’t look here for a complete list of topics, nor for a 
complete set of references. My hope is that a physics student who has taken some 
courses on Quantum Field Theory and has been exposed to the Standard Model 
of electroweak interactions and has heard of pions and B mesons will be able to 
learn a lot of flavor physics. 

The experienced reader may disagree with my choices. Heck, I may disagree 
with my choice if I teach this again. In spite of years of experience, I almost always 
find in retrospect that the approaches opted for on teaching a course for the first 
time are far from optimal. The course is just a crude approximation to one that is 
an evolving project. If only I got to teach this a few more times, I would get really 
good at it. 

But my institution, and most institutions in the USA can’t afford to spend 
Professors’ time teaching various advanced and technical courses; the demand is 
largest for large undergraduate service courses, for pre-meds and engineering ma¬ 
jors, where the Distinguished Professor’s unique expertise is, frankly, irrelevant and 
useless. Still, it is what pays the bills. 

That’s where TASI comes in to fill a tremendous need (fill a hunger, would 
be even more appropriate) of the students of theoretical particle physics. I feel 
privileged and honored that I have been given the opportunity to present these 
lectures on Flavor Physics and hope that the writeup of these lectures can be of 
use to many current and future students that may not have the good fortune of 
attending a TASI. 

Being lectures, there are lots of exercises that go with these. The exercises are 
not collected at the back, not even at then end of each chapter or section. They are 
interspersed in the material. The problems tend to expand or check on one point 
and I think it’s best for a student to solve the exercises in context. I have many 
ideas for additional exercises, but only limited time. I hope to add some more in 
time. Some day I will publish the solutions. Some are already typed into the TeX 



V 


source and I hope to keep adding to it. You should be able to find the typeset 
solutions as an ancillary file in the arXiv submission. 

No one is perfect and I am certainly far from it. I would appreciate alert 
readers to send me any typos, errors or any other needed corrections they may 
find. Suggestions for any kind of improvement are welcome. I will be indebted if 
you’d send them to me at bgrinstein@ucsd.edu 

Benjamin Grinstein 
San Diego, June 2014 



Chapter 1 


Flavor Theory 


1.1 Introduction: What/Why/How? 

WHAT : There are six different types of quarks: u (“up ”), d { “down”), s (“strange”), 
c (“charm”), b (“bottom”) and t (“top”). Flavor physics is the study of different 
types of quarks, or “flavors,” their spectrum and the transmutations among them. 
More generally different types of leptons, “lepton flavors,” can also be included 
in this topic, but in this lectures we concentrate on quarks and the hadrons that 
contain them. 


WHY: Flavor physics is very rich. You should have a copy of the PDG, or at 
least a bookmark to pdg.lbl.gov on your computer. A quick inspection of the PDG 
reveals that a great majority of content gives transition rates among hadrons with 
different quark content, mostly decay rates. This is all flavor physics. We aim at 
understanding this wealth of information in terms of some simple basic principles. 
That we may be able to do this is striking endorsement of the validity of our 
theoretical model of nature, and gives stringent constraints on any new model 
of nature you may invent. Indeed, many models you may have heard about, in 
fact many of the most popular models, like gauge mediated SUSY breaking and 
extended technicolor, were invented to address the strong constraints imposed by 
flavor physics. Moreover, all observed CP violation (CPV) in nature is tied to 
flavor changing interactions, so understanding of this fundamental phenomenon is 
the domain of flavor physics. 


HOW: The richness of flavor physics comes at a price: while flavor transitions 
occur intrinsically at the quark level, we only observe transitions among hadrons. 
Since quarks are bound in hadrons by the strong interactions we face the prob¬ 
lem of confronting theory with experiment in the context of mathematical models 
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that are not immediately amenable to simple analysis, like perturbation theory. 
Moreover, the physics of flavor more often than not involves several disparate time 
(or energy) scales, making even dimensional analysis somewhere between difficult 
and worthless. Many tools have been developed to address these issues, and these 
lectures will cover many of them. Among these: 

• Symmetries allow us to relate different processes and sometimes even to pre¬ 
dict the absolute rate of a transition. 

• Effective Field Theory (EFT) allows to systematically disentangle the effects 
of disparate scales. Fermi theory is an EFT for electroweak interactions at 
low energies. Chiral Lagrangians encapsulate the information of symmetry 
relations of transitions among pseudo-Goldstone bosons. Heavy Quark Ef¬ 
fective Theory (HQET) disentangles the scales associated with the masses 
of heavy quarks from the scale associated with hadron dynamics and makes 
explicit spin and heavy-flavor symmetries. And so on. 

• Monte-Carlo simulations of strongly interacting quantum field theories on the 
lattice can be used to compute some quantities of basic interest that cannot 
be computed using perturbation theory. 

1.2 Flavor in the Standard Model 

Since the Standard Model of Strong and Electroweak interactions (SM) works so 
well, we will adopt it as our standard (no pun intended) paradigm. All alternative 
theories that are presently studied build on the SM; we refer to them collectively 
as Beyond the SM (BSM). Basing our discussion on the SM is very useful: 

• It will allow us to introduce concretely the methods used to think about and 
quantitatively analyze Flavor physics. It should be straightforward to extend 
the techniques introduced in the context of the SM to specific BSM models. 

• Only to the extent that we can make precise calculations in the SM and con¬ 
front them with comparably precise experimental results can we meaningfully 
study effects of other (BSM) models. 

So let’s review the SM. At the very least, this allows us to agree on notation. 
The SM is a gauge theory, with gauge group 517(3) x SU{2) x 17(1). The 517(3) 
factor models the strong interactions of “colored” quarks and gluons, 517(2) x 
17(1) is the famous Glashow-Weinberg-Salam model of the electroweak interactions. 
Sometimes we will refer to these as 517(3)c and 517(2)w x 17(1 )y to distinguish 
them from other transformations with the same groups. The matter content of the 
model consists of color triplet quarks: left handed spinor doublets with 17(1) 
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“hypercharge” Y = 1/6 and right handed spinor singlets and with Y = 2/11 
and Y = —1/3. The color {SU{3)), weak (5?7(2)), and Lorentz-transformation 
indices are implicit. The “i” index runs over i = 1, 2, 3 accounting for three copies, 
or “generations.” A more concise description is q\ = (3,2)i/g, meaning that q\ 
transforms as a 3 under SU{3), a 2 under SU{2) and has Y = 1/6 (the U{1) 
charge). Similarly, = (3, 1 ) 2/3 = (3, l)_i/ 3 . The leptons are color 

singlets: = (l, 2 )_i /2 and = (l,l)-i. 

We give names to the quarks in different generations; 

^ ((di) ’ (si) ’ (fei)) ’ ^R = (urArAr), dl‘R = {dR,SR,bR). 

( 1 . 1 ) 

Note that we have used the same symbols, “u” and “d,” to denote the collection of 
quarks in a generation and the individual elements in the first generation. When 
the superscript i is explicit this should give rise to no confusion. But soon we will 
want to drop the superscript to denote collectively the generations as vectors q^, 
Ur and d/j, and then we will have to rely on the context to figure out whether it 
is the collection or the individual first element that we are referring to. For this 
reason some authors use the capital letters Ur and Dr to denote the vectors in 
generation space. But I want to reserve U for unitary transformations, and I think 
you should have no problem figuring out what we are talking about from context. 

Similarly, for leptons we have 

The last ingredient of the SM is the Brout-Englert-Higgs (BEH) field, H, a 
collection of complex scalars transforming as (1, 2 ) 3 / 2 . The BEH field has an ex¬ 
pectation value, which we take to be 

<"> = 

The hermitian conjugate held H = ia^H* transforms as (l, 2 )_ 3/2 and is useful 
in constructing Yukawa interactions invariant under the electroweak group. The 
covariant derivative is 

Df, = df, + igsT^Al + + igiYB^. (1.4) 

Here we have used already the Pauli u* matrices as generators of SU{2), since 
the only helds which are non-singlets under this group are all doublets (and, of 
course, one should replace zero for above in the case of singlets). It should 
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also be clear that we are using the generalized Einstein convention: the repeated 
index a is summed over a = 1,..., — 1, where Nc = 3 is the number of colors, 

and j is summed over j = 1,2,3. The generators T“ of SU{3) are normalized so 
that in the fundamental representation Tr(r“r^) = With this we see that 

(H) is invariant under Q = + Y, which we identify as the generator of an 

unbroken U{1) gauge group, electromagnetic charge. The field strength tensors 
for Afj,, and are denoted as G^y, W^y, and B^y, respectively, and that of 
electromagnetism by F^j^y. 

The Lagrangian of the SM is the most general combination of monomials con¬ 
structed out of these fields constrained by (i) Lorentz invariance, (ii) Gauge in¬ 
variance, and (iii) renormalizability. This last one implies that this monomials, 
or “operators,” are of dimension no larger than four. Field redefinitions by linear 
transformations that preserve Lorentz and gauge invariance bring the kinetic terms 
to canonical form. The remaining terms are potential energy terms, either Yukawa 
interactions or BEH-field self-couplings. The former are central to our story: 


- ^Yuk = ^ ^^u)HqLiU^ji + \D]HqLi(Rj^ + XE^Hiue^j^ + h.c. 
hi 


(1.5) 


We will mostly avoid explicit index notation from here on. The reason for upper 
and lower indices will become clear below. The above equation can be written 
more compactly as 

- >CYuk = HqLXuUR + HqLXndR + HIeXe^r + h.c. (1.6) 


Flavor “symmetry.” In the absence of Yukawa interactions {i.e., setting Xjj = 
Xd = Xe = 0 above) the SM Lagrangian has a large global symmetry. This is 
because the Lagrangian is just the sum of covariantized kinetic energy therms, 
with the sum running over all the fields in irreducible representations 
of the the SM gauge group, and one can make linear unitary transformations among 
the helds in a given SM-representation without altering the Lagrangian: 

Ql ^ Uq qi , Ur ^ Uu Ur , ... eR ^ Ue eR , 

where UqJJq = ••• = UeUe = 1. Since there are Nf = 3 copies of each SM- 
representation this means these are Nf x Nf matrices, so that for each SM- 
representation the redefinition freedom is by elements of the group U{Nf). Since 
there are five distinct SM-representations (3 for quarks and 2 for leptons), the full 
symmetry group is U{NfY = 17(3)^.^ In the quantum theory each of the U{1) fac- 

^Had we kept indices explicitly we would have written —>■ Uqj , ujj —>■ Uu] ujj ,..., ejj —>■ 
UTj ejj. The fields transform in the fundamental representation of SU{Nf). We use upper indices 
for this. Objects, like the hermitian conjugate of the fields, that transform in the anti-fundamental 
representation, carry lower indices. The transformation matrices have one upper and one lower 
indices, of course. 
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tors (corresponding to a redefinition of the Nf fields in a given SM-representation 
by multiplication by a common phase) is anomalous, so the full symmetry group 
is smaller. One can make non-anomalous combinations of these f7(l)’s, most fa¬ 
mously B — L, a symmetry that rotates quarks and leptons simultaneously, quarks 
with —1/3 the phase of leptons. For our purposes it is the non-abelian factors that 
are most relevant, so we will be happy to restrict our attention to the symmetry 
group SU{Nf)^. 

The flavor symmetry is broken explicitly by the Yukawa interactions. We can 
keep track of the pattern of symmetry breaking by treating the Yukawa couplings as 
“spurious,” that is, as constant fields. For example, under SU{Nf)q x SU{Nf)u the 
first term in (1.6) is invariant if we declare that Xjj transforms as a bi-fundamental, 
Xu —)■ UqXuUl; check: 

QlXuur —> QLUl{UqXuUl)UuUR = quXuUR. 

So this, together with Xd —>• UqXoU^ and Xr —>• LI^XeUI renders the whole La- 
grangian invariant. 

Why do we care? As we will see, absent tuning or large parametric suppression, 
new interactions that break this “symmetry” tend to produce rates of flavor trans¬ 
formations that are inconsistent with observation. This is not an absolute truth, 
rather a statement about the generic case. 

In these lectures we will be mostly concerned with hadronic flavor, so from here 
on we focus on the Gp = SU{3)^ that acts on quarks. 


1.3 The CKM matrix and the KM model of CP-violation 


Replacing the BEH field by its VEV, Eq. (1.3), in the Yukawa terms in (1.6) we 
obtain mass terms for quarks and leptons: 


V _ V - V _ 

— Tm = -^j^lXuur T -^duXodR -L -^cuXecr h.c. 


(1.7) 


Eor simpler computation and interpretation of the model it is best to make fur¬ 
ther field redefinitions that render the mass terms diagonal while maintaining the 
canonical form of the kinetic terms (diagonal, with unit normalization). The field 
redefinition must be linear (to maintain explicit renormalizability of the model) 
and commute with the Lorentz group and the part of the gauge group that is un¬ 
broken by the electroweak VEV (that is, the U{\) x SU{3) of electromagnetism 
and color). This means the linear transformation can act to mix only quarks with 
the same handedness and electric charge (and the same goes for leptons): 


Ur —^ VujiUr, Ur —)• Vuj^ul, dp —)• Vdj^dR, du —)• Vd^di- ( 1 - 8 ) 
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Finally, the linear transformation will preserve the form of the kinetic terms, say, 
ULi$UL {uLV^j^)i${VuLUL) = UL{Vil^Vui^)i0UL, if = 1, that is, if it is 

unitary. 

Now, choose to make a redefinition by matrices that diagonalize the mass terms, 
VV'jV,, = >-'u. = \'o . (1.9) 

Here the matrices with a prime, X'jj and A^, are diagonal, real and positive. 


Exercises 

Exercise 1.3-1: Show that this can always be done. That is, that an arbitrary matrix 
M can be transformed into a real, positive diagonal matrix M' = by a pair of 

unitary matrices, P and Q. 


Then from 



+ dzjXjydji + ciXegr + h.c. 


we read off the diagonal mass matrices, mu = 
rriE = vXeIV^- 


I — —T dX^d cXe^^ 

( 1 . 10 ) 

vXu / \/2) mu = vVjyl^/2 and 


Since the field redefinitions in (1.8) do not commute with the electroweak 
group, it is not guaranteed that the Lagrangian is independent of the matrices 
, • • •, Vdji . We did choose the transformations to leave the kinetic terms in 
canonical form. We now check the effect of (1.8) on the gauge interactions. Con¬ 
sider first the singlet fields ur. Under the field redefinition we have 


Ur {gsA°‘T°'+‘^gi$)uR urV^^ {gsA°'T°‘ +'^gi$)VujiUR = ur {gsA°‘T°'+\gi$)uR ■ 


It remains unchanged. Clearly the same happens with the dR fields. The story 
gets more interesting with the left handed fields, since they form doublets. First 
let’s look at the terms that are diagonal in the doublet space: 


quigs^'T^ + \g2Ml''cj^ + \gx$)qL 

= uuigsA'T^ + \g2Mx'" + \gx$)uL + h{gsA"T^ - \g2Mi'' + \gx$)dL 

This is clearly invariant under (1.8). Finally we have the off-diagonal terms. For 
these let us introduce 




cr ± 


and = 


=F iW^ 


\/2 ’ ^/2 

so that cr^VF^ -|- -j- a~W~ and (o'~^)i 2 = \/2, (o'~) 2 i = V^, and all 

other elements vanish. It is now easy to expand: 


qLy2(cr^W^ -h a^W^)qL = ■^g2ULW^du + -^g2dLW ur 

^ j:^g2UL{VlVj]^^dL + (1.11) 
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A relic of our field redefinitions has remained in the form of the unitary matrix 
V = Vui^Vdj^. We call this the Cabibbo-Kobayashi-Maskawa (CKM) matrix. 

A general unitary 3x3 matrix has 3^ complex entries, constrained by 3 complex 
plus 3 real conditions. So the CKM matrix is in general parametrized by 9 real 
entries. But not all are of physical consequence. We can perform further transfor¬ 
mations of the form of (1.8) that leave the mass matrices in (1.9) diagonal and non¬ 
negative if the unitary matrices are diagonal with 14 ^ = 14 ^ = diag(e®“i, e®"^) 

and 14i = VdR = diag(e*^4 e*^®). V is redefined by Vij —> These 

five independent phase differences reduce the number of independent parameters 
in K to 9 — 5 = 4. It can be shown that this can in general be taken to be 3 rotation 
angles and one complex phase. It will be useful to label the matrix elements by 
the quarks they connect: 


V = 


Vud 

Vus 

Vub\ 

Vcd 

Vcs 

Vcb 

Vtd 

Vts 

Vth 


Observations: 

1. That there is one irremovable phase in V impies that CP is not a symmetry 
of the SM Lagrangian. It is broken by the terms di + ul. 

To see this, recall that under CP UL’^^di —>■ and —)■ —W~. 

Hence CP invariance requires = V. 


Exercises 

Exercise 1.3-2: In QED, charge conjugation is e^^e —>■ —ey^e and —>■ —A^. 

So e^e is invariant under C. 

So what about QCD? Under charge conjugation —>■ g(—T“)^ 7 ^g, but 

(—j^ajT _ equal —T“ (nor T“). So what does charge conju¬ 

gation mean in QCD? How does the gluon field, A“, transform? 

Exercise 1.3-3: If two entries in mu (or in mo) are equal show that V can 
be brought into a real matrix and hence is an orthogonal transformation (an 
element of 0(3)). 


2. Precise knowledge of the elements of V is necessary to constrain new physics 
(or to test the validity of the SM/CKM theory). We will describe below how 
well we know them and how. But for now it is useful to have a sketch that 
gives a rough order of magnitude of the magnitude of the elements in V: 


V ~ 




with e 10“\ 


( 1 . 12 ) 




CHAPTER 1. FLAVOR THEORY 


3. Since VV^ = V^V = 1 the rows as well as the columns of V are orthonormal 
vectors. In particular, Kk^/k = 0 for j / i. Three complex numbers 
that sum to zero are represented on the complex plane as a triangle. As the 
following table shows, the resulting triangles are very different in shape. Two 
of them are very squashed, with one side much smaller than the other two, 
while the third one has all sides of comparable size. As we shall see, this will 
play a role in understanding when CP asymmetries can be sizable. 


0 



shape 

(normalized to unit base) 

12 


e + e + e^ = 0 

- -- 1 e 

23 

VcdVu + VcsVtl + VM = 0 

-h -h = 0 


13 

Vu^Vm + VusVtl + = 0 

-h -h = 0 



These are called “unitarity triangles.” The most commonly discussed is in 
the 1-3 columns, 


VudV:b+K,v:,+vM = o => 

1 



Dividing by the middle term we can be more explicit as to what we mean by 
the unit base unitarity triangle: 




= 0 


We drew this on the complex plane and introduced some additional notation: 
the complex plane is z = p + if] and the internal angles of the triangle are^ 
a, j3 and 7; see Fig. 1.1. 

The angles of the unitarity triangle, of course, are completely determined by 
the CKM matrix, as you will now explicitly show: 

^This convention is popular in the US, while in Japan a different convention is more common: 
(j)i = P, (j)2 = a and 03 = 7. 
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Figure 1.1: Unitarity triangle in the p-rj plane. The base is of unit length. The 
sense of the angles is indicated by arrows. 


Exercises 


Exercise 1.3-4: Show that 


(i) 

(ii) 


/3 = arg - 


VcdVcl 

VuVtl 


, a = arg 


VtdVtl 


and 7 = arg ( — 


VudV:b 

ycdV:^ 


These are invariant under phase redefinitions of quark fields (that is, under 
the remaining arbitrariness). Hence these are candidates for observable 
quantities. 




(hi) The area of the triangle is -3 ImfTf = - 2 m (F„^V;;F^fcF4). 

cd CO I cd CD I 

(iv) The product J = Inn (a “Jarlskog invariant”) is invariant 

under re-phasing of quark fields. 


Note that Im = J{6ijSki — SuSkj) is the common area of all the 

un-normalized triangles. The area of a normalized triangle is J divided by the 
square of the magnitude of the side that is normalized to unity. 


4. Parametrization of V: Since there are only four independent parameters in 
the matrix that contains 3x3 complex entries, it is useful to have a com¬ 
pletely general parametrization in terms of four parameters. The standard 
parametrization can be understood as a sequence of rotations about the three 
axes, with the middle rotation incorporating also a phase transformation; 


V = CBA 
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IP- 





Figure 1.2; Experimentally determined unitarity triangles [1]. Upper pane: “fat' 
1-3 columns triangle. Lower pane: “skinny” 2-3 columns triangle. 
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where 

/ C12 S12 0\ / C13 0 sise-*^ /I 0 0 \ 

A = j —Si2 Ci2 0 I , ^ = I 0 1 0 1 ) C = j 0 C23 S23 j • 

VO 0 1/ V-Sl3e*'^ 0 C13 / VO -S23 023/ 

Here we have used the shorthand, Cij = cos 9ij, Sij = sin Oij, where the angles 
6ij all lie on the first quadrant. From the phenomenologically observed rough 
order of magnitude of elements in V in (1.12) we see that the angles 9^ are 
all small. But the phase 5 is large, else all triangles would be squashed. 

An alternative and popular parametrization is due to Wolfenstein. It follows 
from the above by introducing parameters A, A, p and rj according to 

si 2 = A, S23 = AA^, sise"^ = A\^{p + iri) (1.13) 


The advantage of this parametrization is that if A is of the order of e, while 
the other parameters are of order one, then the CKM has the rough order 
in (1.12). It is easy to see that p and p are very close to, but not quite, the 
coordinates of the apex of the unitarity triangle in Fig. 1.1. One can adopt 
the alternative, but tightly related parametrization in terms of A, A, p and p: 


Si2 — A, S23 — AA^, Sl3e*^ 


AA^(p + ip) 


yi - A2A4 

y/l — A 2 [l — A‘^X‘^{p + ip)] 


Exercises 

Exercise 1.3-5: (i) Show that 


ip = 




hence p and p are indeed the coordinates of the apex of the unitarity 
triangle and are invariant under quark phase redefinitions. 

(ii) Expand in A <C 1 to show 


1 \2 


V = 


1-^A 
-A 

yAA^(l — p — ip) 


A AX^{p — ip)\ 

1 - iA^ AA2 + OiXy 
-AA^ 1 / 


1.4 Once more on Flavor Symmetry 

Suppose we extend the SM by adding terms (local, ^ Lorentz and gauge invariant) 
to the Lagrangian. Since the SM already includes all possible monomials (“opera¬ 
tors”) of dimension 4 or smaller, we consider adding operators of dim > 5. We are 

®By “local” we mean a product of fields all evaluated at the same spacetime point. 
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going to impose an additional constraint, and we will investigate its consequence. 
We will require that these operators be invariant under the flavor transformations, 
comprising the group Gp- 


Ql Uq qi , ur^ Uu Ur , dR^ Ud dR , 
We add some terms to the Lagrangian 


Xu^UqXuUl XD^UqXnUl 

(1.14) 


C^C+AC 


A£ = ^ aOi 

i 


with, for example. 


Oi = G^HuRT^a^^’^XuQL, 

O2 = qL'X^X}jjXjjqR dR'y^Xj^X}j^dR. 

Here is the field strength for the SU (3)c gauge field (which is quite irrelevant 
for our discussion, so don’t be distracted). Consider these operators when we rotate 
to the basis in which the mass matrices are diagonal. Start with the first: 

= a;,HuRTV'\[, 

We see that the only flavor-changing interaction is governed by the off-diagonal 
components of X'jjV. Similarly 

O 2 qLl^{>^uf<i'LdRli,{>^DfdR, where qp = • 
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Exercises _ 

Exercise 1.4-6: Had we considered an operator like Oi but with Hdn instead of 
Hur the flavor off-diagonal terms would have been governed by AdEI. Show this 
is generally true, that is, that flavor change in any operator is governed by V and 
powers of A'. 

Exercise 1.4-7: Exhibit examples of operators of dimension 6 that produce flavor 
change without involving Xu,d- Can these be such that only quarks of charge -1-2/3 
are involved? (These would correspond to Flavor Changing Neutral Currents; see 
Sec. 1.5 below). 


This construction, restricting the higher dimension operators by the flavor sym¬ 
metry with the Yukawa couplings treated as spurious, goes by the name of the 
principle of Minimal Flavor Violation (MFV). Extensions of the SM in which the 
only breaking of Gp is by Xjj and Xp automatically satisfy MFV. As we will see 
they are much less constrained by flavor changing and CP-violating observables 
than models with generic breaking ol Gp- Let’s consider some examples: 


1. The supersymmetrized SM. I am not calling this the MSSM, because the 
discussion applies as well to the zoo of models in which the BEH sector has 
been extended, e.g., the NMSSM. In the absence of SUSY breaking this model 
satisfies the principle of MFV. The Lagrangian is 

£ = y [Qe^Q -b Ue^U -b De^D^^ -bgauge & H kinetic terms-b Jd^6W-\-h..c. 
with superpotential 


W = HiUyuQ + H 2 Dyp)Q -b non-quark-terms 

Here V stands for the vector superfields'^ and Q, D, [/, Hi and H 2 are chiral 
superfields with the following quantum numbers: 


(3, 2 ) 1/6 
U ~ (3, l)_2/3 
D ~ (3, 1 ) 1/3 


Hi ~ ( 1 , 2)1/2 

H 2 ~ ( 1 , 2 )_ i /2 


The fields on the left column come in three copies, the three generations we 
call flavor. We are again suppressing that index (as well as the gauge and 
Lorentz indices). Unlike the SM case, this Lagrangian is not the most gen¬ 
eral one for these fields once renormalizability, Lorentz and gauge invariance 

'^Since I will not make explicit use of vector superfields, there should be no confusion with the 
corresponding symbol for the the CKM matrix, which is used ubiquitously in these lectures. 
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are imposed. In addition one needs to impose, of course, supersymmetry. 
But even that is not enough. One has to impose an /^-symmetry to forbid 
dangerous baryon number violating renormalizable interactions. 

When the Yukawa couplings are neglected, yu = Ud = 0, this theory has 
a SU{3)^ flavor symmetry. The symmetry is broken only by the couplings 
and we can keep track of this again by treating the couplings as spurions. 
Specihcally, under SU{3)^, 

Q^UqQ, U^SuU, D^SdD, yu^SljyuUl yo ^ ShynUl 

Note that this has both quarks and squarks transforming together. The 
transformations on quarks may look a little different than the transformation 
in the SM, Eq. (1.14). But they are the same, really. The superhcial difference 
is that here the quark helds are all written as left-handed fields, which are 
obtained by charge-conjugation from the right handed ones in the standard 
representation of the SM. So in fact, the couplings are related by yu = and 
yn = A^, and the transformations on the right handed fields by Su = U* and 
Sd = U^. While the relations are easily established, it is worth emphasizing 
that we could have carried out the analysis in the new basis without need 
to connect to the SM basis. All that matters is the way in which symmetry 
considerations restrict certain interactions. 

Now let’s add soft SUSY breaking terms. By “soft” we mean operators of 
dimension less than 4. Since we are focusing on flavor, we only keep terms 
that include fields that carry flavor: 

A£susY-bkg = 

+ {4’hi4’u9u4’q + 4’h24'dgD4’q + h.C.) (1.15) 

Here (px is the scalar SUSY-partner of the quark X. This breaks the flavor 
symmetry unless oc 1 and gu,D oc yu,D (see, however, Exercise 1.4- 

8 ). And unless these conditions are satished new flavor changing interactions 
are generically present and large. The qualiher “generically” is because the 
effects can be made small by lucky coincidences (fine tunings) or if the masses 
of scalars are large. 

This is the motivation for gauge mediated SUSY-breaking [2]: 


SUSY 
breaking sector 


gauge 


interaction 


SUSY SM 
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The gauge interactions, e.g., Qe^Q, are diagonal in flavor space. In theo¬ 
ries of supergravity mediated supersymmetry breaking the flavor problem is 
severe. To repeat, this is why gange mediation and its variants were invented. 

2. MFV Fields. Recently CDF and DO reported a larger than expected forward- 
backward asymmetry in tt pairs produced in pp collisions [3] . Ronghly speak¬ 
ing, define the forward direction as the direction in which the protons move, 
and classify the outgoing particles of a collision according to whether they 
move in the forward or backward direction. You can be more careful and 
define this relative to the CM of the colliding partons, or better yet in terms 
of rapidity, which is invariant under boosts along the beam direction. But 
we need not worry abont such subtleties: for our purposes we want to nn- 
derstand how flavor physics plays a role in this process that one would have 
guessed is dominated by SM interactions [4]. Now, we take this as an edu¬ 
cational example, but I should warn you that by the time you read this the 
reported effect may have evaporated. In fact, since the lectures were given 
DO has revised its result and the deviation from the SM expected asymmetry 
is now much smaller [5]. 

There are two types of BSM models that explain this asymmetry, classified 
according to the the type of new particle exchange that produces the asym¬ 
metry: 

(i) s-channel. For example an “axi-gluon,” mnch like a gluon but massive 
and coupling to axial currents of qnarks. The interference between vec¬ 



tor and axial currents, ^ t u t 

produces a FB-asymmetry. It turns out that it is best to have the sign 


of the axiglnon coupling to t-quarks be opposite that of the conpling to 


u quarks, in order to get the correct sign of the FB-asymmetry with¬ 
out violting constraints from direct detection at the LHC. But different 
couplings to u and t means flavor symmetry violation and by now you 
should suspect that any complete model will be subjected to severe con¬ 
straints from flavor physics. 

(ii) t-channel: for example, one may exchange a scalar, and the amplitnde 


now looks like this: 
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This model has introduced a scalar cj) with a coupling (l)iu (plus its her- 
mitian conjugate). This clearly violates flavor symmetry. Not only we 
expect that the effects of this flavor violating coupling would be directly 
observable but, since the coupling is introduced in the mass eigenba- 
sis, we suspect there are also other couplings involving the charge-+2/3 
quarks, as in (pcu and cj)iu and flavor diagonal ones. This is because 
even if we started with only one coupling in some generic basis of fields, 
when we rotate the fields to go the mass eigenstate basis we will gener¬ 
ate all the other couplings. Of course this does not have to happen, but 
it will, generically, unless there is some underlying reason, like a sym¬ 
metry. Moreover, since couplings to a scalar involve both right and left 
handed quarks, and the left handed quarks are in doublets of the elec- 
troweak group, we may also have flavor changing interactions involving 
the charge-(—1/3) quarks in these models. 

One way around these difficulties is to build the model so that it satisfies 
the principle of MFV, by design. Instead of having only a single scalar field, 
as above, one may include a multiplet of scalars transforming in some rep¬ 
resentation oi Gp. So, for example, one can have a charged scalar multiplet 
(j) transforming in the (3, 3,1) representation of SU{3)q x SU{3)u x SU{3)d, 
with gauge quantum numbers (1, 2)_^/2 with interaction term 

>^qL4>UR with (j) ^ Uq^(j)Ul^. 

Note that the coupling A is a single number (if we want invariance under 
flavor). This actually works! See [6]. 


Exercises 

Exercise 1-4-8: Below Eq. (1.15) we said, “This breaks the flavor symmetry 
unless Alq „ ^ oc 1 and gu,D oc yu,D-" This is not strictly correct (or, more 
bluntly, it is a lie). While not correct it is the simplest choice. Why? Exhibit 
alternatives, that is, other forms for ^ and gu,D that respect the symmetry. 
Hint: Read below. See (1.16). 

Exercise 1-4-9: Classify all possible dim-4 interactions of Yukawa form in the 
SM. To this end list all possible Lorentz scalar combinations you can form out 
of pairs of SM quark fields. Then give explicitly the transformation properties 
of the scalar field, under the gauge and flavor symmetry groups, required to 
make the Yukawa interaction invariant. Do this first without including the SM 
Yukawa couplings as spurious and then including also one power of the SM 
Yukawa couplings. 
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1.5 FCNC 


This stands for Flavor Changing Neutral Currents, but it is used more generally 
to mean Flavor Changing Neutral transitions, not necessarily “currents.” By this 
we mean an interaction that changes flavor but does not change electric charge. For 
example, a transition from a 6-quark to an s- or d-quarks would be flavor changing 
neutral, but not so a transition from a 6-quark to a c- or u-quark. Let’s review 
flavor changing transitions in the SM: 


1. Tree level. Only interactions with the charge vector bosons change flavor; 
cf. (1.11). The photon and Z coupe diagonally in flavor space, so these 
“neutral currents” are flavor conserving. 


For example, n —)> peu is 


d 



V 

e 


2 . 


1-loop. Can we have FCNCs at 1-loop? Say, 6 —)> sy? Answer; YES. Here is 



Hence, FCNC are suppressed in the SM by a 1-loop factor of ~ 


relative to the flavor changing charged currents. 


al ^ « 

lOvr^ 47rc^ 


Exercises 

Exercise 1.5-10: Just in case you have never computed the /r-lifetime, verify that 


Y ^ « r(/r ^ er-^z/e) 


G%ml 

1927r3 


neglecting rUe, at lowest order in perturbation theory. 

Exercise 1.5-11: Compute the amplitude for Z —>■ 6s in the SM to lowest order 
in perturbation theory (in the strong and electroweak couplings). Don’t bother to 
compute integrals explicitly, just make sure they are finite (so you could evaluate 
them numerically if need be). Of course, if you can express the result in closed 
analytic form, you should. See Ref. [8]. 
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1.6 GIM-mechanism: more suppression of FCNC in 
SM 

1.6.1 Old GIM 


Let’ s imagine a world with a light top and a hierarchy niu < rric < mt My^. 
Just in case you forgot, the real world is not like this, but rather it has mu <C 
me <C My/ ^ ^mt- We can make a lot of progress towards the computation of the 
Feynman graph for b ^ sj discussed previously without computing any integrals 
explicitly: 


E 

i=u,c,t 

and F{x) is some function that results form doing the integral explicitly, and we 
expect it to be of order 1. The coefficient of this unknown integral can be eas¬ 
ily understood. First, it has the obvious loop factor photon coupling 

constant (e) and CKM factors from the charged curent interactions. Next, 

in order to produce a real (on-shell) photon the interaction has to be of the tran¬ 
sition magnetic-moment form, F^ysa^'^b, which translates into the Dirac spinors 
u{p) for the quarks combining with the photon’s momentum q and polarization 
vector (e) through qfj_eyu{ps)(T^’^u{pb).^ Finally, notice that the external quarks 
interact with the rest of the diagram through a weak interaction, which involves 
only left-handed fields. This would suggest getting an amplitude proportional to 
u{ps) {^^-^'^u{ph) which, of course, vanishes. So we need one or the 

other of the external quarks to flip its chirality, and only then interact. A chirality 
flip produces a factor of the mass of the quark and we have chosen to flip the 
chirality of the b quark because m^ mg. This explain both the factor of mb and 
the projector acting on the spinor for the 6-quark. The correct dimensions 
are made up by the factor of 1/M^. 

Now, since we are pretending < me < mt <V. My/, let’s expand in a Taylor 


W 


u, c, t 



i{qt) 


where 


I = 


®The other possibility, that the photon field Ay couples to a flavor changing current, 
is forbidden by electromagnetic gauge invariance. If you don’t like this argument, here is an 
alternative: were you to expand the amplitude in powers of qjMz you would find the lowest order 
contribution, Cu{psV^u(ph) is absent by gauge invariance, and the leading contribution is linear 
in momentum, as exhibited. 
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series, F{x) = F{0) + xF'{0) + ■ ■ ■ 

F'{0)+■■■ 

Unitarity of the CKM matrix gives Yli=uct ^ib^is = 0 so the first term vanishes. 
Moreover, we can rewrite the unitarity relation as giving one term as a combination 
of the other two, for example, 

VM = - E 


I = 



F(0) + 







giving us 


I^-F'{0) Y, V,,V* 

i=u,c 


2 2 

K- 


We have uncovered additional FCNC suppression factors. Roughly, 




mt - ml 


PVcbVr 


2 2 
— 771 ^ 


2 2 

mt 9 mr 


a4 


a4' 


So in addition the 1-loop suppression, there is a mass suppression {rn^/M^) and 
a mixing angle suppression (e^). This combination of suppression factors was un¬ 
covered by Glashow, Iliopoulos and Maiani (hence “GIM”) back in the days when 
we only knew about the existence of three flavors, u, d and s. They studied neu¬ 
tral kaon mixing, which involves a FCNC for s to d transitions and realized that 
theory would grossly over-estimate the mixing rate unless a fourth quark existed 
(the charm quark c) that would produce the above type of cancellation (in the 
2-generation case). Not only did they explain kaon mixing and predicted the ex¬ 
istence of charm, they even gave a rough upper bound for the mass of the charm 
quark, which they could do since the contribution to the FCNC grows rapidly with 
the mass, as shown above. We will study kaon mixing in some detail later, and we 
will see that the top quark contribution to mixing is roughly as large as that of 
the charm quark: Glashow, Iliopoulos and Maiani were a bit lucky, the parameters 
of the SM-CKM could have easily favored top quark mediated dominance in kaon 
mixing and their bound could have been violated. As it turns out, the charm was 
discovered shortly after their work, and the mass turned out to be close to their 
upper bound. 


1.6.2 Modern GIM 

We have to revisit the above story, since mt <C Mw is not a good approximation. 
Consider our example above, b —?■ sj. The function F{x) can not be safely Taylor 
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expanded when the argument is the top quark mass. However, I is invariant under 
F(x) F(x)+ constant, so we may choose without loss of generality F(0) = 0. 
Then 



We expect F(x} to be order 1. This is indeed the case, F(x} is a slowly increasing 
function of x that is of order 1 at the top quark mass. The contributions from u and 
c quarks to I are completely negligible, and virtual top-quark exchange dominates 
this amplitude. 


Exercises 

Exercise 1.6.2-1: Consider s —)■ dj. Show that the above type of analysis suggests 
that virtual top quark exchange no longer dominates, but that in fact the charm and 
top contributions are roughly equally important. Note: For this you need to know the 
mass of charm relative to Mw ■ If you don’t, look it up! 


1.6.3 Bounds on New Physics, GIM and MFV 


Now let’s bring together all we have learned. Let’s stick to the process b —)• sy, 
which in fact places some of the most stringent constraints on models of new physics 
(NP). Let’s model the contribution of NP by adding a dimension 6 operator to the 
Lagrangian,® 

C pvC 

A£ = —eF^.HqLcy^^'^hR = -^F^,SLaf^‘'bR + ■■■ 


I have assumed the left handed doublet belongs in the second generation and have 
gone to unitary gauge. The coefficient of the operator is C/A^: C is dimensionless 
and we assume it is of order 1, while A has dimensions of mass and indicates 
the energy scale of the NP. It is easy to compute this term’s contribution to the 
amplitude. It is even easier to roughly compare it to that of the SM, 


AInp 

-4sm 


vC 

V2A2 


IT/ T/* I a mh 

' tb ts I 


®The field strength should be the one for weak hypercharge, and the coupling constant should 
be gi. This is just a distraction and does not affect the result; in the interest of pedagogy I have 
been intentionally sloppy. 
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Require this ratio be less than, say, 10%, since the SM prediction agrees at that 
level with the measurement. This gives. 


> 


VMt,rS 


2 „2 


W^W 


0.1 


A > 70 TeV. 


This bound is extraordinarily strong. The energy scale of 70 TeV is much higher 
than that of any existing or planned particle physics accelerator facility. 

In the numerical bound above we have taken C ~ 1, but clearly a small coeffi¬ 
cient would help bring the scale of NP closer to experimental reach. The question 
is what would make the coefficient smaller. One possibility is that the NP is weakly 
coupled and the process occurs also at 1-loop but with NP mediators in the loop. 
Then we can expect C ~ a/dvrs^, which brings the bound on the scale of new 
physics down to about 4 TeV. 

Now let’s consider the effect of the principle of MFV. Instead of a single operator 
we take a collection of operators and make a flavor invariant (when we include 
spurious). Our first attempt is 


AC = -^eFfj_uHqL\D(^^'^dR 


which has the same form, as far as flavor is concerned, as the mass term in the 
Lagrangian and therefore it gives no flavor changing interaction when we go to the 
field basis that diagonalizes the mass matrices. This is not a surprise, we have seen 
this before, in Sec. 1.4. To get around this we need to construct an operator which 
either contains more fields, which will give a loop suppression in the amplitude 
plus an additional suppression by powers of A, or additional factors of spurious. 
We try the latter. Consider, then 

AC = -^eFfj^uHqijXjjXjjXjjO'^ dji. 


When you rotate the fields to diagonalize the mass matrix you get, for the charge 
neutral quark bi-linear, 

XuX^^Xj, ^ vlx^X^^X^V,^ = (1.16) 

our estimate of the NP amplitude is suppressed much like in the SM, by the mixing 
angles and the square of the “small” quark masses. Our bound now reads 


-\/r2 „2 1 

^-1^2 > ^ ^ ^-1/2^ > 4 TeV 

^ 0.1 


This is within the reach of the LHC (barely), even if C ~ 1 which should correspond 
to a strongly coupled NP sector. If for a weakly coupled sector C is one loop 
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> 



> 

C 


10 ^ 

10 ^ 

10 ^ 


IQi 





(s —)■ d) s) (c —> u) 

ArriK, eK Arrid, sin 2/3 Arris, D-D 


Figure 1.3: Bounds on the scale of NP scale from various processes. The NP is 
modeled as dimension 6 operators. No accidental suppression of the coefficient (as 
in MFV) is included. The 6 —)> s case is consistent with the explicit 5 —)• 57 example 
worked out in these notes. The figure is taken from M. Neubert’s talk at EPS 2011. 


suppressed, A could be interpreted as a mass Mnp of the NP particles in the loop, 
and the analysis gives Mnp > 200 GeV. The moral is that if you want to build a 
NP model to explain putative new phenomena at the Tevatron or the LHC you can 
get around constraints from flavor physics if your model incorporates the principle 
of MFV or some other mechanism that suppresses FCNC. 


Exercises 

Exercise 1.6.3-1: Determine how much each of the bounds in Fig. 1.3 is weakened 
if you assume MFV. You may not be able to complete this problem if you do not 
have some idea of what the symbols AMk, ck, etc, mean or what type of operators 
contribute to each process; in that case you should postpone this exercise until that 
material has been covered later in these lectures. 


1.7 Determination of CKM Elements 

Fig. 1.2 shows the state of the art in our knowledge of the angles of the unitarity 
triangles for the 1-3 and 2-3 columns of the CKM. How are these determined? 
More generally, how are CKM elements measured? Here we give a tremendously 
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compressed description. 

The relative phase between elements of the CKM is associated with possible CP 
violation. So measurement of rates for processes that are dominated by one entry 
in the CKM are insensitive to the relative phases. Conversely, CP asymmetries 
directly probe relative phases: 

(i) |Pud| is measured through allowed nuclear transitions. The theory is fairly 
well understood (even if it is nuclear physics) because the transition matrix 
elements are constrained by symmetry considerations. 

(ii) I Pus I) \Vcd\: I Pcs I) |Pcb|) are primarily probed through semi-leptonic de¬ 
cays of mesons, M —)• M'lv {e.g., K~^ —)• The theoretical difficulty 

is to produce a reliable estimate of the rate, in terms of the CKM matrix 
elements, in light of the quarks being strongly bound in a meson. To ap¬ 
preciate the theoretical challenge consider the decay of a pseudoscalar meson 
to another pseudoscalar meson. The weak interaction couples to a “P — A" 
(P=vector, y4=axial) hadronic current, V’^(7^ ~ 7^75)V’) and a corresponding 
leptonic current. The latter, being excluded from the strong interactions, 
offers no difficulty and we can immediately compute its contribution to the 
amplitude. The contribution to the amplitude from the hadronic side then 
involves 

(p'IPIp) = U{q^){p + p'Y + 

The bra and ket stand for the meson final and initial states, characterized 
only by their momentum and internal quantum numbers, which are implicit 
in the formula. The matrix element is to be computed non-perturbatively 
with regard to the strong interactions. Only the vector current (not the 
axial) contributes, by parity symmetry of the strong interactions. The matrix 
element must be a vector, by Lorentz covariance, and it is written as a linear 
combination of the only two vectors, p and p'. In the 3-body decay, p = p' + q 
so q is the sum of the momentum of the lepton pair. The coefficients of the 
expansion, or “form factors,” are functions of the invariants we may form 
out of these vectors. There is only one kinematic variable one can form, 
p ■ p' , because p^ and p'^ are just the fixed square-masses of the mesons. 
It is conventional to write the form factors as functions of Y- When the 
term f-{qYq^ is contracted with the leptonic current one gets a negligible 
contribution, q-{y—A) ~ at least when i = eoi p. So the central problem 
is to determine /+. Symmetry considerations can produce good estimates of 
/+ at specific kinematic points, which is sufficient for the determination of 
the magnitude of the CKM. Alternatively one may determine the form factor 
using Monte Carlo simulations of QCD on the lattice. 



24 


CHAPTER 1. FLAVOR THEORY 


Exercises 

Exercise 1.7-2: Show that q ■ {V — A) ^ me for the leptonic charged current. 

Be more precise than 

For example, if both states are pions then the vector current is the ap¬ 
proximately conserved current associated with isospin symmetry. This gives 
/+(0) ~ 1. One can repeat this for kaons and pions, where the symmetry 
now is Gell-Mann’s SU{3), in which the u, d and s quarks form a triplet. 
The pions and kaons, together with the eta particle form an octet of SU{3). 
In the symmetry limit one then still has /+(0) = 1, but now the symmetry 
is not as good as in the isospin case. Since the largest source of symme¬ 
try breaking is the mass of the strange quark mass, one expects corrections 
/+(0) — 1 oc rris/A, with A a hadronic scale, presumably A ~ 1 GeV. This 
seems like bad news, an uncontrolled 10% correction. Fortunately, by a the¬ 
orem of Ademolo and Gatto, the symmetry breaking parameter appears at 
second order, /+(0) — 1 oc (rUs/A)^ ~ 1%. We cannot extend this to the 
heavier quarks because then rUc/A > 1 is a bad expansion parameter. Re¬ 
markably, for transitions among heavy quarks there is another symmetry, 
dubbed “Heavy Quark Symmetry,” that allows similarly successful predic¬ 
tions; for a basic introduction see [9]. For heavy to light transitions one 
requires other methods, like lattice QCD, to determine the remaining GKMs. 

The green ring in Fig. 1.2 shows the region of the p-f/ plane allowed by the 
determination of \Vub\- More precisely, note that -s/-\- rf = |14b/14s%cb| so 
that the ring requires the determination of the three CKM elements. It is 
labeled “|l%fe|” because this is the least accurately determined of the three 
CKM elements required. 

(hi) Neutral Meson Mixing. Next chapter is devoted to this. It gives, for example, 
in the case of Bd mixing and for Bg mixing. And the case of 

RO mixing is, as we will see, fascinatingly subtle and complex. The yellow 
(“Am^”) and orange & Am^”) circular rings centered at (1,0) in 

Fig. 1.2 are determined by the rate of Bd mixing and by the ratio of rates 
of Bd and Bg mixing, respectively. The ratio is used because in it some 
uncertainties cancel, hence yielding a thiner ring. The bright green region 
labeled sk is determined by CP violation in K^-K mixing. 

(iv) CP asymmetries. Decay asymmetries, measuring the difference in rates of 
a process and the CP conjugate process, directly probe relative phases of 
CKM elements, and in particular the unitarity triangle angles a, [3 and 7 . 
We will also study these, with particular attention to the poster boy, the 
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determination of sin(2/3) from Bfi —)• 'ipKs, which is largely free from hadronic 
uncertainties. In Fig. 1.2 the blue and brown wedges labeled sin 2/3 and 7 , 
respectively, and the peculiarly shaped light blue region labeled a are all 
obtained from various CP asymmetries in decays of mesons. 


Chapter 2 


Neutral Meson Mixing and CP 
Asymmetries 

2.1 Why Study This? 

Yeah, why? In particular why bother with an old subject like neutral-A meson 
mixing? I offer you an incomplete list of perfectly good reasons: 

(i) CP violation was discovered in neutral-A meson mixing. 

(ii) Best constraints on NP from flavor physics are from meson mixing. Look at 

Fig. 1.3, where the best constraint is from CP violation in neutral-iL mixing. 
In fact, other than all of the other observables in the figure involve 

mixing. 

(iii) It’s a really neat phenomenon (and that should be sufficient reason for want¬ 
ing to learn about it, I hope you will agree). 

(iv) It’s an active field of research both in theory and in experiment. I may be 
just stating the obvious, but the LHCb collaboration has been very active 
and extremely successful, and even CMS and ATLAS have performed flavor 
physics analysis. And, of course, there are also several non-LHC experiments 
ongoing or planned; see, e.g., [10]. 

But there is another reason you should pay attention to this, and more generally 
to the “phenomenology” (as opposed to “theory” or “model building”) part of these 
lectures. Instead of playing with Lagrangians and symmetries we will use these to 
try to understand dynamics, that is, the actual physical phenomena the Lagrangian 
and symmetries describe. If you are a model builder you can get by without an 
understanding of this. Sort of. There are enough resources today where you can 
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plug in the data from your model and obtain a prediction that can be tested against 
experiment. Some of the time. And all of the time without understanding what 
you are doing. You may get it wrong, you may miss effects. As a rule of thumb, 
if you are doing something good and interesting, it is novel enough that you may 
not want to rely on calculations you don’t understand and therefore don’t know if 
applicable. Besides, the more you know the better equipped you are to produce 
interesting physics. 


2.2 The e parameter 

We start our discussion of mixing by concentrating on the neutral K system. It will 
be straightforward to carry the formalism over to the other neutral meson systems, 
D^, = Bd and Bg. Although they are all based on the same physics, each has 

its own peculiarities. Moreover it is a historical inconvenience that the notation 
and conventions used by the different communities that study these mesons differ 
unnecessarily. So we have to start somewhere, and we choose the historically 
important K^-K system. 

Consider the “weak” eigenstates K^-K , These are really flavor eigenstates in 
the sense that has the quantum numbers of a d quark and an s-antiquark, 
(sd), and K = (ds). Note the peculiar choice the strange meson, contains 
an anti-strange quark, and carries strangeness —1. By “strangeness” we mean a 
U{1) group that rotates sl and sr (in the diagonal mass basis) by a common 
phase. These states are related by CP, charge conjugation changes particles into 
anti-particles and parity turns a pseudoscalar into minus itself: 

CP\K^) = -|F°) CP\K^) = -|A°) 


Now we want to study the time evolution of these one particle states. We can 
always go to the rest frame, and since we do not involve many-particle states 
regardless of their quantum numbers we can model the time evolution by a two- 
state Schrodinger equation with a 2 x 2 Hamiltonian. Of course, since these one 
particle states may evolve into states that are not accounted for in the two state 
Hamiltonian, the evolution will not be unitary and the Hamiltonian will not be 
Hermitian. Keeping this in mind we write, for this effective hamiltonian 


H = M - 



/ M - |r 

\m*i2 - 


Mi 2 - |ri2\ 
M-iT; 


( 2 . 1 ) 


where M'^ = M and = T. 



28 


CHAPTER 2. NEUTRAL MESON MIXING AND CP ASYMMETRIES 


Exercises 

Exercise 2.2-3: Show that CPT implies Hu = H 22 . Note: If you want to test CPT 
you relax this constraint. See Ref. [11]. 

CP invariance requires M ^2 — -^12 and r]i 2 = ri 2 - Therefore either lm.Mi 2 7 ^ 0 
or Imri2 7 ^ 0 signal that CP is violated. Now, to study the time evolution of the 
system we solve Schrodinger equation. To this end we first solve the eigensystem 
for the effective Hamiltonian. We define the eigenvalues to be 


Mk, - = M - |r ± i(AM - lAT) 

s s 

and the corresponding eigenvectors are 


Kl) 

s 


1 

VW+W) 


(l + e)|i^°)±(l 




( 2 . 2 ) 


If e = 0 then these are CP-eigenstates: CP\Kl) = —\Kl) and CP\Ks) = \Ks). 
Since CP| 7 r 7 r)£=o = |vr7r)£=o and C'P|7r7r7r)£=o = —| 7 r 7 r 7 r)£=o we see that if CP were 
a good symmetry the decays Kl ^ vrTrvr and Ks —>• are allowed, but not so the 
decays —>■ tttt and Ks —)• Trvrvr. Barring CP violation in the decay amplitude, 

observation of Kl —)• vrTr or Ks —)• vrTTTr indicates e 7 ^ 0, that is, CP-violation in 
mixing. 

This is very close to what is observed: 


Br(P 5 ^ TTTr) = 100.00 ± 0.24% 

Br(P:L^7r7r) = 0.297 ±0.023% (2.3) 

Br(P'^ —)■ Trvrvr) = 33.9 ± 1.2% 

Hence, we conclude (i) e is small, and (ii) CP is not a symmetry. Notice that 
3m.n- ~ 3(140) MeV = 420 MeV while mK ~ 490 MeV, leaving little phase space 
for the decays K —)• vrvrvr. This explains why Pl is much longer lived than Ks] the 
labels “L” and “S” stand for “long” and “short,” respectively: 

tks = 0.59 X 10“^° s 
tkl = 5.18 X 10“^ s 

For the and Bs mesons the approximate CP eigenstates have a 

large number of decay channels available, many consisting of two particle states 
with much lower masses than the decaying particle and phase space suppression is 
negligible. The widths of the eigenstates are comparable so it makes no sense to 
call them “long” and “short.” Instead they are commonly referred to as heavy and 
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light, Xh and Xl (with X = or Bg), even though the mass differences are 

very small too. 


Exercises 

Exercise 2.2-4-' la (2-3) the branching fraction Br(it '5 —>■ tttt) is nearly 100% while 
Br{KL —>■ TTTrTr) is only about 34%. Explain. Hint: Consult the PDG. 


Measuring e. The semileptonic decay CP-asymmetry is 

_ T{Kl —t 7r“e+z^) - T{Kl —t 7r+e“z7) _ |1 + ep - |1 - ep ^ 

T{Kl —)■ 7r“e+z^) + T{Kl —t 7r+e“z7) |1 + ep + |1 — 

The first equal sign dehnes it, the second one computes it (assuming CP symmetry 
in the decay amplitude) and the last one approximates it (|e| <C 1). Experimental 
measurement gives (5exp = 0.330 ± 0.012%, from which Ree ~ 1.65 x 10“^. 


2.2.1 Formulas for e 


Eventually we will want to connect this effective 2x2 hamiltonian to the underlying 
fundamental physics we are studying. This can be done using perturbation theory 
(in the weak interactions) and is an elementary exercise in Quantum Mechanics 
(see, e.g., Messiah’s textbook, p.994 ~ 1001 [12]). With \K^) = |1) and \K^) = |2) 
one has 


Mij = M6ij + {i\H\j) + 2 ^ PP- jfZTE - 


+ 


Py = 2ttY^ 5{M - En){i\R\n){n\H\j) + • • • 


(2.4) 

(2.5) 


Here the prime in the summation sign means that the states 11) and |2) are excluded 
and PP stands for “principal part.” Beware the states are normalized by {i\j) = 5ij 
rather than (p'jp) = — (let alone {p'\p) = 2E6^^\p — p')). Also, H is 

a Hamiltonian, not a Hamiltonian density R; H = f d^xR. It is the part of the 
SM Hamiltonian that can produce flavor changes. In the absence of H the states 
\K^) = |1) and \K ) = |2) would be stable eigenstates of the Hamiltonian and 
their time evolution would be by a trivial phase. It is assumed that this flavor¬ 
changing interaction is weak, while there may be other much stronger interactions 
(like the strong one that binds the quarks together). The perturbative expansion 
is in powers of the weak interaction while the matrix elements are computed non- 
perturbatively with respect to the remaining interactions. Of course the weak 
flavor changing interaction is, well, the Weak interaction of the electroweak model, 
and below we denote the Hamiltonian by Hyj. 
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To make the full connection with more fundamental physics, we need a formula 
for e in terms of the effective hamiltonian. We start with an exercise: 


Exercises 

Exercise 2.2.1-1: Show that e is given by (the solution to) 

1 + e ^Mi 2 -|ri 2 lAM-|Ar 
1 - e “ AM - |Ar “ 2 M *2 - ir *2 

where AM and AF can be themselves determined from (the solution to) 

(AM)2 - i(Ar)^ = 4|Mi2p - |ri 2 p and AMAF = 4Re(Mi2rj2) 

Exercise 2.2.1-2: If CP is conserved show that AM = 2ReMi2 and AF = 2ReFi2. 

As we have seen, empirically, e is not vanishing but, still, it is small, so it is 
natural to assume 


ImMi 2 ReMi 2 and Imri 2 <C Reri 2 . 

Then, to linear order in e 

ImMi2 - 2lmri2 

e ss ^^- 

AM - |Ar 

We will see shortly that Imri 2 <C ImMi 2 . Also, empirically AT ~ —2AM; more 
precisely, arg(AM + |Ar) = 46.2° which we will approximate as 7r/4. We finally 
arrive at the semi-empirical formula 

_ ImMi2 
y/2AM 

This looks like a peculiar formula, mixing derived quantities with more fundamental 
ones (after using empirical input!). So an explanation is in order. We will explain in 
great detail that the imaginary part of M 12 is dominated by short distance physics 
(I will explain what is meant by that) and one can derive nice closed form formulas 
for it. On the other hand AM ~ 2ReMi2 is hard to compute, is complicated by 
long distance physics, cares little about CP violation, and is measured accurately, 
so we can just use the value. 

2.3 The e' parameter. 

e measures the amount of CP admixture of the CP even and odd would-be CP- 
eigenstates; A/, can decay into Trvr because it contains a small “contamination” of 


( 2 . 6 ) 
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the CP-even component. But Kl can also decay into vrvr even in the absence of this 
contamination if there is CP violation in the decay process. If this were absent, so 
all the CP violation would be from mixing, we would expect 

Kl —)■ TT^iT~ Ks —)■ 7r'''7r“ 

Kl ^ TT^TT^ Ks —?• Tr^TT^ 

In other words, both Kl and Ks decays are from the common CP-even compo¬ 
nent. But if the CP-odd component can also decay to tttt then the equality is not 
guaranteed, the CP-even and CP-odd components may decay into the charge and 
neutral pions with different relative rates. Violations of this relation are measured 
by the parameter e'. Let 

_ (7r+7r"|^^|iLi) _ {7^^7^^\n^a\KL) 

- (7r+7r-|il^^|iL5) '''' “ {Tr^TT^ln^lKs)' 

It is standard practice to parametrize the decay amplitudes in terms of 2-pion 
states with dehnite isospin. Define 

^j7r7r{I)\n^\K%^ = AjP^^ (2.7) 

where Sj are the (measured) final states phase shifts for the s-wave 2-pion states 
of isospin I, |7r7r(/))out = e^*^^|7r7r(/))in. Aj is in general complex, but it is con¬ 
ventional to redefine \K^) —>• e*“|iL®) to absorb the phase in Aq. So we take 

Im(^o) = 0. 


Exercises 

Exercise 2.3-3: Show that if CP is respected by the decay amplitude, then ? 7 _|__ = 
Vw = e. 

Exercise 2.3-4: Show that CPT invariance implies ^^^{TnT{I)\Hyj\K°)in = —Aje^^E 
Exercise 2.3-5: Show that for s-wave states 

( 2 . 8 ) 

(2.9) 


^(Itt+tt )-t|7r 7r+)) = ^|7r7r(/= 2))-h Y'||7r7r(/= 0)) 

= 2)) - = 0)) 


Combining we find 


V+- = 


^o)(^Iniyl2-|-eReyl2)-|-\/2eylo 
eh52-<5o)(Re^2 + feReyl2) + V^Aq ' 


( 2 . 10 ) 


These can be further simplihed using empirical information (data) and approxi¬ 
mations. The “A/ = 1/2 rule” is the observation that |yl 2 |/^o ~ 1/20. Using the 
approximations |^ 2 |/^o ^ 1 and |e| <C 1 above, and writing t]^ _= e-|-e' we obtain 


Im^2 

V2Ao 


,i{S2—So) 


( 2 . 11 ) 
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Moreover, r/oo = e — 2e' so that 


( 2 . 12 ) 

Voo e 

Experimentally it is found that ReeYe = (1.66 ± 0.23) x 10“^, so that indeed the 
approximations made are valid and CP-violation in the decay amplitudes is small. 

Incidentally, we can now show we were justified in assuming Imri 2 <C Reri 2 . 
From Eq. (2.5) 


ri2 = {2tt)Y^ 5{M - En){K^\H^\n) {n\H^\K^). 

n 


Note that since Br(Rr 5 —>• vrvr) is nearly 100%, E is dominated by Trvr intermediate 
states. Since Aq |^ 2 |, P is dominated by 7r7r(/ = 0). In our convention Im^o = 0 
and therefore Imri 2 does not get a contribution from 7r7r(/ = 0). 

2.4 Sketch of SM accounting for e and e'/e. 

2.4.1 e. 

We would like to determine the SM prediction for e. Rather than giving a full 
computation from first principles, we will use the semi-empirical formula (2.6). 
You may wonder, is this a cheat, since we used data, e.g., AP ~ 2AM, to derive 
this formula? The point is that the emphasis is in accounting for CPV: 

(i) In principle we do not need to use empirical input. 

(ii) It is useful already this way. For example, as we will see this is often sufficient 
in constraining NP that enters only through short distance effects. 

So our task is to estimate ImMi 2 . From (2.4), 

Mi2 = {K^\H^\K^) + + ■ ■ ■ (2.13) 

^ rriKO - En 

n 

We use this below. Beware this still has the peculiar non-relativistic normalization 
of states. For the first term this is easy: replace the hamiltonian density TLw for 
the Hamiltonian and let the states be normalized a la Bjorken and Drell, as 
{p'\p) = — p'). One better: then divide by 2mx so the states are now 

relativistically normalized, {p'\p) = 2E5^^\p — p'). 
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Short Distance vs long distance contributions. So what is = f d^xTiy,! 
We could simply use Tiw = g 2 W^J^ + h.c., where is the charged hadronic 
(quark) current (no need to include the neutral current coupling to because it 
does not change flavor). But then we would need to go to high orders in the above 
formula. Diagrammatically the AS* = 2 process is from (to lowest order): 


s u,c,t d 



d u,c,t s 


(2.14) 


This is fourth order in Hyj. But since rriK 'C Mw we expect that Fermi theory, 
in which we replace the VF-propagator mediated interaction by a local 4-fermion 
vertex, is a good approximation, i.e. 


s ► s ► u 
W 

d < u 



(2.15) 


That is, when the momentum through the W propagator is negligible compared to 

Myy 






q^-Ml 


w 


-I^VusUYPlS 


q^O 




This corresponds to an effective Hamiltonian density^ 


n 


AS=1 


92 


ULlf^SL, 


and similarly for other terms (with varying external quarks). 

One advantage of using this effective Hamiltonian^ is that for the AS = 2 
transitions, the formulae for M 12 and ri 2 , we first get a contribution at second 
order, rather than fourth, in the expansion (that is, order G'j^). However 

^Note: since the coefficient of the effective Hamiltonian density is proportional 

to l/f^. This is also Fermi’s constant. More precisely, Gf is defined by Hw = GfIV^CV — A) ® 
(V — A), where V — A stands for a “vector minus axial” current, as in — ysfd. Hence 

Gf~ l/(2ii^), and the well known value Gf = 1-17 x 10“®GeV“^ gives the value of the VEV 
in EW theory, v = 1/V2Gf = 246 GeV. 

^We get tired of saying “effective Hamiltonian density” so it is standard practice to omit 
“density.” 
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(i) the evaluation of the second order term, the one with the PP in (2.13), is 
cumbersome, and 

(ii) we have not exploited the fact that rriK ^ mt- 

In fact, we could also consider the charm quark as so heavy that an approxima¬ 
tion based on mx ^ is useful. All these comments are related: if we can 
approximate 



then we can we can insert the local vertex into the hrst order term in the expression 
for Mi 2 , Eq. (2.13). We call these “short distance contributions.” This is because 
At ~ h/NE ~ h/{Mx — mK), for X = the least of Mw, rUc and mt, is much smaller 
than any time scale associated with the dynamics of strong interactions. This is, of 
course, much better for the case of top-quarks than for charm-quarks, but even for 
these, the approximation is pretty good and particles with charm cannot appear 
as on-shell intermediate states in the sum in (2.13). 

So we have split the calculation into two pieces, a short distance contribu¬ 
tion, evaluated from the hrst order term in (2.13) and containing the diagrams 
where both internal quarks are heavy (c or t), and a “long distance contribution,” 
evaluated from the sum over states in (2.13). The latter can involve states that 
propagate over long times when their energy is close to mx (near the energy pole), 
e.g., for |n) = Itt-tt), Itttttt), ... 

Long distance contributions are difficult to compute. But generally contribu¬ 
tions from NP to the long distance terms are negligible. So we can happily extract 
the long distance contributions from data (using AM ~ 2ReMi2) and concentrate 
on computing the short distance contributions, where NP may more readily show 
up. Note that this is consistent with the semi-empirical approach adopted above in 
writing formulae for e and e'/e. Moreover, in some cases the long distance contri¬ 
butions can be calculated through Monte Carlo simulations of QCD on the lattice.^ 
So from here on we concentrate on the short distance contribution. Roughly, 


/ s u,c,t d ^ 



ImMi 2 ~ Im 


r\j 
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Im 


G'pM^ 

47r^ 




{K^\dL7^SLdL7^,SL\K'') 


Here / is a dimensionless function that is computed from a Feynman integral of the 
box diagram and depends on M\y implicitly. Note that the diagram has a double 
GIM, one per quark line. In the second line above, the non-zero imaginary part is 
from the phase in the KM-matrix. In the standard parametrization Vud and Vus 
are real, so we need at least one heavy quark in the Feynman diagram to get a 
non-zero imaginary part. I will explain later why the diagram with one u quark 
and one heavy, c or t, quark is suppressed. We are left with c and t contributions 
only. Notice also that KM-unitarity gives ^qdYqs — 0; since lmy*^V^g = 0, 
we have a single common coefficient, = —ImV^^V^j, = A'^X^rj in terms of 

the Wolfenstein parametrization. For later use we define Xq = and as we 

just saw ImAc = —ImA*. We will also need ReAc ~ —A and ReAi « —A^X^{1 — p). 

The matrix element of the four-quark operator between kaon states requires 
understanding of non-trivial hadrodynamics. We parametrize our ignorance as 
follows, 

{K^\dLl^SL dLl^.SL\K'') = IIWkBk (2.16) 


where we have used the standard, relativistic normalization of states. Here the 
mass rriK and the decay constant fx are known data, so Bx is the dimensionless 
parameter characterizing the value of the matrix element. Monte Carlo simulations 
of QCD on the lattice suggest Bx ~ 0.77 [14]. The choice of parametrization may 
seem peculiar. It is motivated by the following exercise: 


Exercises 

Exercise 2.4.Tl: Bx = 1 in the “vacuum insertion approximation.” This consists 
of summing over all possible insertions of the vacuum, |0)(0|, as an intermediate 
state in the matrix element above, including Fierz rearrangements. For N^. colors 
show that in the vacuum insertion approximation one has {K^\dLj^SL ) = 

i(2 + 2/N,)flml. You will need (0|dV75s|R°(p)) = fxpC 

Exercise 2.4-1-2: Compute the partial width for 77+ —)■ e+iz in terms of fx- This 
explains why fx is called the “kaon decay constant.” 

Assembling all these factors we get 


ImMi2 = 


2mx 

X [A^A®(1 - p)f{mt, mt) - X{f{mc, rric) - firuc, mt))] 


We have included a factor of lj2mx to revert to relativistic normalization of states. 
All that is left to do is to compute the function f(x,y) by performing a Feynman 
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integral. This takes some work, or you can find it in the literature [15], so I 
will only display the result, given in terms of the ratios Xg = rrig/M^ using the 
approximation Xc 1: 


f{mc,mt) 


= Xc 


Ax 


^ Xc 2>xt Sxflnxi 

^ xt 4(1-xt) 4(1-Xi) 

i —llx^ + xf 3xflnxt 

4(l-xt)2 2(1-Xi)3 


2 


Our hnal result is (modulo short distance QCD corrections which we will return 
to below) is 

e « C^I^Cc-^y^T] [A^X^{1 - p)f{mt, mt) - A(/(mc, me) - /(me, mt))] (2.17) 


where the fixed, uninteresting, data-driven constant is 


Gv^vr^AM/i' 


3 X 10"^. 


Let’s check the order of magnitude of the contributions of the various terms. The 
overall coefficient includes a A^X^r] ~ (0.2)® ~ 3 x 10“^, and the terms in the 
square bracket in (2.17) give 


A^X^il - p)f{mt,mt) ~ 
Xf{mc,mt) ~ Xf{mc,mc) 


(0.2)® ~ 3 X 10"^ ^ e ~ 3 X 10"® 
~ (0.2) ~ 10 e ~ 10 ® 


The result is quite remarkable: all the terms give comparable contributions to e, 
and all of them are of the correct order of magnitude! 


Exercises 

Exercise 2.4-1-3: (i) Pretend you can compute ReMi 2 by computing Feynman di¬ 
agrams and therefore using • • ){sLAdL){sL'lfj.dL), so as 

to ignore the cumbersome Estimate AMx- Compare with the experi¬ 

mental value. 

(ii) How does this change if you were to ignore c, t quarks (so there is no GIM 
mechanism)? 

(iii) Now pretend there are only two generations (ignore the b and t quarks). How 
large does rric have to be to account for AMjf? Historically this computation 
is very important: it led to the prediction of the existence of the charm quark 
and of its mass, and it is how the GIM mechanism was discovered [17]. 
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Short distance QCD corrections: a precap. Before we move on: it may 
bother you that we made the replacement 






We have expanded in powers of Gp (perturbation theory in the weak interactions) 
but kept strong interactions exact: states are due to strong interactions and the 
computation of matrix elements is in the strongly interacting theory. But we 
understand strong interactions as described by QCD and we can easily see we have 
left some contributions out, for example: 



Notice however that graphs with gluons connecting external quarks are ac¬ 
counted for already, since they are included in the computation of states and matrix 
elements, e.g., 



^Precap is defined in www.urbandictionary.com as: Annoying pre-commercial preview of 
what’s to come after the commercial break on the program you are already in the midst of 
watching. 
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W 


We will later return to this question and explain how terms like those in (2.18) 
give contributions that are perturbatively computable, of the form ^ Innig/rrix 
or as/lTrlnMiy/mK and we will find out how to use the renormalization group 
to sum the leading logs, he., the contributions from multi-loop diagrams that 
contain (as/47rlnmg//i)”. This will also tell us at what scale to compute as{fi), 
hence removing this uncertainty. Numerically the effect is to modify our result, 
Eq. (2.17), by some short distance correction factors as follows: 

e « U^/^CeA'^X^r] [r] 2 A^X^{l - p)f{mt,mt) - X{r]if{mc,mc) - mf{mc,mt))] 

(2.19) 

with r/ 1 , 2,3 ~ 1.4, 0.6, 0.5 [26, 27]. 




2.4.2 Direct CPV: e'/e 

I’ll be brief. Most concepts have been introduced. I will concentrate on new 
features. 

Recall, from Eq. (2.11) 

Ttti 

e' K, i ^ gh'^2-<5o) (Im^o = 0 basis). 
v/2^0 

We must look ai K ^ vrvr, that is, at s —)■ duu. We will need to include underlying 
transitions at the loop level, else we will not obtain any CPV. At tree level 
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Digression. This one loop contribution is called a “penguin” diagram. I do 
not know why. I have heard many stories. It was certainly first introduced in the 
context we are studying. Here is a penguin-like depiction of the diagram: 



Figure 2.1: Penguin Feynman diagram. 


End digression. 

Now since gluons have / = 0, the only isospin change is through the VF-loop 
in the figure, transmuting an s-quark to a w-quark. Hence, penguin diagrams give 
AI = 1/2 but no AI = 3/2. Therefore, importantly 

(a) Penguins may play a role in understanding the origin of the AI = 1/2 rule, 

(b) The computation will lead to a phase only in iF —)• 7 r 7 r(I = 0). But we chose 
ImHo = 0 by re-phasing (re-defining the state by a phase) K^. So before 
re-phasing compute the phase of Hg, say Then rotate K^, Aq = Hge"*^ 
and A 2 = ^ 26 “*^ so that Aq is real. 


Then 


e 


/ 


\/2Ao 


—i sin ^ 


1^2! 

^/2Ho 


pi-KjA 

- - £ 

20\/2 


( 2 . 20 ) 


where I have used ^ <C 1, the AI = 1/2 rule and the empirical value of 82 — 5o. 

Estimating e' je in the SM is more difficult than computing e. The reason is 
a combination of two facts. First, for e' one needs to compute decay amplitudes. 
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which involve three particles, one in the initial and two in the final states, as 
opposed to for e which requires amplitudes with two particles in the matrix element 
for mixing. The second is that there are several competing contributions and 
there are some delicate numerical cancellations. I will therefore only say a few 
words and refer the interested student to the slightly more complete treatment 
in the textbook in Ref. [16]. As we have seen the penguin diagram gives only a 
contribution to Aq. However, if in the penguin diagram we replace a photon or a 
Z for the gluon the resulting graph gives also a contribution to A 2 . The reason is 
that neither the photon nor the Z couplings respect isospin, they transform as a 
combination of I = 0 and 1 = 1. These digrams are called “electroweak penguins” 
to distinguish them from the plain vanilla penguins (sometimes called “strong” 
penguins, to emphasize the distinction). Why bother? After all these digrams are 
suppressed by ai, 2 /a 3 relative the strong penguin. The point is that the AI =1/2 
rule acts to amplify the direct contributions to a phase in A 2 . More specifically, 
the phase ^ in (2.20) gets a contribution from both AI = 1/2 and AI = 3/2 
amplitudes, 

ImAo ImA2 
ReAg ReA2 

Furthermore, since there are cancellations, it turns out the effects of isospin break¬ 
ing by quark masses are non-negligible and have to be included. A full account is 
beyond the scope of these lectures. 

2.5 Time Evolution in mixing. 

We have looked at processes involving the ‘physical’ states and Ks- As these 
are eigenvectors of H their time evolution is quite simple 

ij^\KL,s) = {Ml,s - ITl,s)\Kl,s) ^ \KL,s{t)) = 

Since \Ki^s) eigenvectors of H, they do not mix as they evolve. But often one 
creates or in the lab. These, of course, mix with each other since they are 
linear combinations of Kl and Ks- 

We’ll analyze this in some generality so we may apply the results to D^, 
and Bg as well. The two mesons system X^-X has effective ‘hamiltonian’ 

H = M-|r, Mt = M, rt = r, 

and the physical eigenstates are labeled conventionally as Heavy and Light: 


\Xh) = p\X^) + q\X^), \Xl) = p\X^) - q\X^) 


(2.21) 
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As before, 


where 


p ^ ^ Mi2-|ri2 ^ 1 AM - |Ar 
q 'am - iAr 2 M *2 - ir *2 


Mh - = M - |r ± i(AM - iAr) 

The time evolution of is as above, 


Now we can invert, 


|AH(t)) = e '^^l'\Xh{0 )). 

L L 


X^) = ^^{\Xh) + \Xl)). 


'X'^) = i-(\XH)-\XL)). 


2q 


Hence, 


2p L 


and using (2.21) for the states at t = 0 we obtain 


^0, 


|A^t))=/+(t)|A^) + 2/_(t)|A 


where 


f±{t) = i 






= 2^ 




Similarly, 


= 2^ 


'x\t)) = ^j.{t)\x^) + U{t)\x 


1 ± 






( 2 . 22 ) 


(2.23) 


(2.24) 


Example: Time dependent asymmetry in semileptonic K decay (“AT^s 
decay”). This is the time dependent analogue of 5 above. The experimental 
set-up is as follows: 

monochromatic beam of and 


p beam 


I “n 
target 


‘magic box” 


\ \ 


e TT ' z/ e ' TT u 


detector array 
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Figure 2.2: Charge asymmetry in semi-leptonic neutral kaon decays, from an 
experiment by Gjesdal et al, [28]. The solid curve is a ht to the formula (2.25) 
from which the parameters F^, AM, a and Re(e) are extracted. 


The proton beam hits a target, and the magic box produces a clean monochro¬ 
matic beam of neutral K mesons. These decay in flight and the semileptonic decays 
are registered in the detector array. Assume there are N^ and Nj^o and K ’s 
from the beam, respectively. Measure 


m 


N+ -N_ 
N+ + N- 


as a function of distance from the beam (which can be translated into time from 
production at the magic box). Here N^ refers to the total number of events 
observed with charge ± lepton. In reality “vr^” really stands for “hadronic stuff” 
since only the electrons are detected. We have then, 


Nko [TiK^it) 7r“e+z/) - r(A°(t) ^ 7r+e“i7)] 


m = 


+ Nj^o 


T{K (t) —)• TT e'^u) — T(K (t) —)• 7r'’'e v) 


Nko [r(A:°(t) ^ 7r“e+i/) + T{K^{t) 7r+e“i^)] 


+ Nj^o 


r{K (t) —)• TT e'^u) -|- T(K (t) —)• 7r'’'e v) 


Exercises 
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Exercise 2.5-1: Use T{K^{t) —>■ tt e'^u) oc |(7r e'^r'\Hw\K°{t))\'^ and the assump¬ 
tions that 

(i) {Tr~e~^i^\Hw\K°(t)) = 0 = {TT~^e~i'\Hw\K^{t)) 

(ii) {TT~e~^v\Hw\K^{t)) = {Tr~^e~vlHwlK^(t)) 
to show that 


m 


o 

1 

o 

l/+(t)p-l/(t)pi(^ 

1 

p 

(Nko + Nj^o) 

l/+(t)p + l/_(t)pl(^ 

1 

p 


I{Nko + Nj.o)\f_{t)\ 


-i(7V^o-7V c)|/_(t)|2 



Justify assumptions (i) and (ii). 

The formula in the exercise is valid for any X^-X^ system. We can simplify 
further for kaons, using pjq = (l-|-e)/(l — e), a = {Nj^o — Nj^o)/ {Nj^o + N^^o) and 
AT Rs —r 5 . Then 


d{t) 


“ [l/+(i)P - l/-(i)P] + 4Re(e)|/_(t)p 
[|/+(t)|2 + |/_(t)|2]-4aRe(6)|/_(t)|2 ^ 

2 oe“ 2 ^s* cos(AMt) + (l + — 2 e“ 2 '"s'* cos(AMf))2 (l -|- |) Re(e) 

1 -h e-'^A 


(2.25) 


Figure 2.2 shows the experimental measurement of the asymmetry [28]. The 
solid curve is a fit to the formula (2.25) from which the parameters r 5 , AM, a and 
Re(e) are extracted. The fit to this figure gives AMk = (0.5287T0.0040) x 10^° s“^. 
The current value, from the PDG is AMk = (0.5293 ± 0.0009) x lO^'^s"^. 


2.6 CP-Asymmetries: Interference of Mixing and De¬ 

cay 

Very generically a CP-asymmetry a is defined as 

r(process) — r(CP conjugate process) _ T — T 
r(process) -|- r(CP conjugate process) T -|- T 

Under what conditions can this be non-zero? T ~ |(out|in)p and if (outjin) has 

definite transformation properties under CP, e.g., (outjin) -)> ±(outjin), then 

CP 

r = r and a = 0. We can get a non-zero asymmetry from interference between 
two amplitudes with opposite, definite CP properties, one even and one odd under 
CP: (out I in) = (out|in)_|_ -|- (out] in) _ —)• (out] in) + — (out|in)_. Then 

|(out|in)+ -h (out|in)_p — |(out|in)+ — (out|in)_p 2Re((out|in) + (out|in)l) 
|(out|in)+ -h (out|in)_P -|- |(out|in)+ — (out|in)_p |(out|in) + |2 -|- |(out|in)_P 


■tS IkQ ■tS 1*0 
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One way to get an interference is to have two “paths” from |in) to |out). For 
example, consider an asymmetry constructed from F = F(X^ —)• /) and F = 
F(X —)• /), where / stands for some final state and / its CP conjugate. Then F 
may get contributions either from a direct decay —)■ / or it may first oscillate 
into X and then decay X —)■ /. Note that this requires that both X^ and its 
antiparticle, X , decay to the same common state. Similarly for F we may get 
contributions from both X —)■ / and the oscillation of X into X^ followed by a 
decay into /. In pictures. 





Concretely, 


F(XO(t) ^ /) oc \U{t){f\H^\X^) + f_{ty-{f\H^\X^)f 

^\U{t)Af + f.{tr-Af\^ 

F(X°(t) ^ /) oc |/_(t)2(/|i/^|X0) + Uit){f\H^\xY 
= lf/-(^)^7 +/+(^)^7p 

I hope the notation, which is pretty standard, is not just self-explanatory, but fairly 
explicit. The bar over an amplitude A refers to the decaying state being X , while 
the decay product is explicitly given by the subscript, e.g., Aj = {f\Hw\X ). 

Exercises 

Exercise 2.6-2: If / is an eigenstate of the strong interactions, show that CPT implies 
\ Af\^ = \Aj\^ and \Aj\^ = \Af\^ 

The time dependent asymmetry is 

.. ^ F(X0(t)^/)-F(X°(t)^/) 

F(X0(f)^/) + F(X°(f)^/) 


and the time integrated asymmetry is 
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where —)•/) = dtT{X^{t) —)• /), and likewise for the CP conjugate. 

These are the generalizations of the quantities we called 5 and 5{t) we studied for 
kaons. 

For the rest of this section we will make the approximation that AF is negligible. 
For the case of for example, AF/F ~ 10“^, while for Bg the ratio is about 10%. 
This simplifies matters because in this approximation 


f±{t) 




cos(^AMf) 

—i sin(^AMt) 


If one further approximates \q/p\ = 1, which is the analog of Re(e) <C 1 for K^, 
then one finds (see exercises below): 


a = 



(pj - 

+ ^Im(p2+pE) 



(2.26) 


where^ 



Aj 

and p = . 

Y 


This formula will be our workhorse. I will leave it up to the student, guided by 
specific exercises at the end of the section, to go through the same analysis in the 
time dependent case. In fact, the time dependent case is simpler to analyze® but 
the central observations are the same in both time dependent and time independent 
asymmetries. We consider several special cases. 


Case I: Self-conjugate final state. Assume / = ±/. Such self-conjugate states 
are easy to come by. For example D^D~ or, to good approximation, J/ipRs- Now, 
in this case we have A^ = YAf and Aj = YAf, so that p = l/p. Since these final 
states are eigenstates of the strong interactions, using the result of exercise 2.6-2 
we have \p\ = \p\ and therefore \p\ = 1. We already assumed \q/p\ = 1, so we have 
ImpS = —Imp2, So one has 


a = 


AM 

r 


l + (^)' 


Im I p- 

p 


®It is unfortunate that the standard notation for AfjAf uses the same symbol, p, as the 
parameter of the Wolfenstein parametrization of the CKM matrix elements. It will hopefully be 
clear from the context which one we are referring to. 

®So the choice of presentation must seem non-pedagogical, but I did want to have the student 
have the opportunity to work out the time dependent case. 
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Here is what is amazing about this formula, for which Bigi and Sanda [18] were 
awarded the Sakurai Prize for Theoretical Particle Physics: the pre-factor (the 
stuff multiplying the “Im” part), depending only on AM/P can be determined 
from independent measurements (mixing and lifetime), and then what is left is 
independent of non-computable, non-perturbative corrections. The point is that 
what most often frustrates us in extracting fundamental parameters from experi¬ 
ment is our inability to calculate, that is, make a prediction that depends on the 
parameter to be measured. I now explain this claim. 

The leading contributions to the processes —)• / and B —)• / in the case 

/ = d+d- are shown in the following figures: 




Either using CPT or noting that as far as the strong interactions are concerned 
the two diagrams are identical, we have 

Since \p\ = 1 we know it is a pure phase, and we see that the phase is given purely 
in terms of KM elements. 

To complete the argument we need q/p. To this end we analyze mixing in the 
case of B^ mesons. Now, the diagram that gives mixing is just like in the neutral 
kaon system: 


b u,c,t d 



d u,c,t ^ 


and just like in the K^-K^ case it involves a factor Ylq q/ ^qbYd ^q'b^q'd 
But now IT), 14^ are sides of a fat triangle so that f{mt,mt) dominates the contri¬ 
butions of the virtual quarks. Therefore M 12 is short distance dominated, ri 2 is 
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negligible (explaining why AT is negligible), and the phase in M 12 is dominantly 
from Mi 2 oc Neglecting ri 2 we have then 

p ^ 2^ ^ ^ ^ ^ 

q AM 2M*2 IM 12 I 





Figure 2.3: Experimental evidence for time dependent CP-asymmetry in 6 —)• ccs 
decays. The left pannel shows results from BaBar for B^{B ) —)■ D^D~ [22] while 
the right pannel is Belle’s B'^{B^) —)• 'il^{2S)Ks [23]. 


Collecting results we have 


Im 



= Im 


y^yu 

y^ycd ytiXu 


= Im(e2*^) 


sin(2/3) 


and the asymmetry is o = [(AM/r)/(l + (AM/P)^)] sin(2/3). Measurements of the 
asymmetry and the mixing parameters give (twice the sine of) one of the angles of 
the unitarity triangle without hadronic uncertainties! 


Exercises 

Exercise 2.6-3: Calculate the time integrated asymmetry without assuming \q/p\ = 
1. Then take \q/p\ —>■ 1 to verify (2.26). 

Exercise 2.6-4-' With the same assumptions as in the analysis above for the time- 
integrated asymmetry (jpj = \q/p\ = 1), show that the time-dependent asymmetry in 
is a = sin(AMt) sin(2/3). 
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Exercise 2.6-5: Above we assumed that 5° —>■ D^D (and ^ D'^D ) is domi¬ 
nated by a single diagram, and from this we showed that p is a pure CKM phase. 

(i) Exhibit other diagrams (other topologies) that contribute to this decay that 
have different CKM matrix dependence than this leading, “tree level” one. 

(ii) Show one can always write A = e®“T -b where a and /3 are weak phases 
(NOT necessarily the phases in the unitarity triangle) and T and P are the 
rest, including the magnitude of the CKM and the strong interaction matrix 
elements. Moreover, show that PA = e“*“T -b 

(iii) Assuming \P/T\ <C 1 show that now 

a{t) = cos{AMt)C + sin{AMt)S (2.27) 

where C = 0{\P/T\) and S = sin(2/3) -b 0{\P/T\). Exhibit these relations 
explicitly. 

(iv) More generally, for any / = ±/ write A/ = ^pf with \q/p\ = 1 to show that 

1- |A/P 2ImA/ 

l + |Ay|2’ l + |A^p 


Take a look at Fig. 2.3 where the experimental measurement of the rates for —)■ 

D'^D~ [22] and B^{B^) —>■ 'tjj{2S)Ks [23] display a clearly visible asymmetry, and 
Eq. (2.27) is a good fit to the time dependent asymmetry. Nice! 

Exercise 2.6-6: The most celebrated case is i? —> J/ij^Ks. Here are the leading 
diagrams: 






(i) Show that other {e.g., “penguin”) contributions are loop plus CKM suppressed. 

(ii) Neglecting those corrections determine a and a{t). Be sure to include the q/p 
factors for the component of K^{K ) in ATg. 

Exercise 2.6-7: Another case of much interest is Bd —>■ tt+tt” (and its cousin, Bd —>■ 

Tr°TT°). 

(i) Display the diagrams that contribute to these decays. 

(ii) Assuming tree level dominance, what angle of the unitarity triangle determines 
the decay asymmetry? 
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(iii) Argue that in this case, however, tree level dominance is questionable at best. 

Once it was realized that the tree diagrams are not dominant [19, 20], a method was 
proposed that combines several decays using isospin symmetry, for a clean determi¬ 
nation of an angle of the unitarity triangle [21]. 

Exercise 2.6-8: We’ve discussed AT® and mixing within the SM, but not Bg nor 
£»o. For Bg we need replace Vqg for Vqd everywhere. To see that you understand this 
show that the CP asymmetry for Bg J/4’4> (to good approximation ^ is a pure ss 
state) is no more than a few per-cent in the SM. 



Chapter 3 


Effective Field Theory 

3.1 Introduction 

Recall that we left open the question of how to include 


9 



in X^-X^ mixing, since we computed in terms of the matrix element {X^\0\X^) 
of a local operator O. In fact, we never quite justified properly why we can use a 
local operator instead of a time ordered product of interactions in the electro-weak 
Lagrangian. We also mentioned a related problem, how to deal with the scale 
uncertainty, that is, how to choose between, say, as{Myi/) and asiniK)- We’ll 
address these problems in this chapter. We will get into the guts of how it all 
works; the price we’ll pay is limited time for explicit examples. I think it is the 
correct emphasis. 

The scale uncertainty problem derives form having disparate scales. The tech¬ 
nique we’ll utilize to address this is the effective field theory (EFT). It allows one 
to look at the physics of the shortest distance/time scales ignoring the longer ones, 
and then move sequentially to longer distances/times. 

The problems we are facing are artifacts of perturbation theory. For example, 
if we could compute non-perturbatively, or at least perturbatively to all orders, 
we would use as(/^) for the coupling, with an arbitrary renormalization scale fi, 
together with \u^d{ii), 51 , 2 (m)- Then the physical amplitudes would actually be 
;U-independent. Of course this is the content of the renormalization group equation 
(RGE), which we will use extensively. 
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There is a related point worth mentioning. Disparate scales often result in 
possible breakdown of perturbation theory. The best example is in grand unified 
theories (GUTs) for which Mgut can be enormously larger that the electroweak 
scale, of order (with v = 246 GeV). If you compute, say, e"'“e“ —)■ ijT 

at a CM energy of the order of current accelerators in a GUT in terms of its sole 
coupling constant, ^gut, you’ll find to 1-loop that 

.4 = .4bo„(i+c2S21i„1%t+...) 

\ TT J 

Here c = some number of 0(1) and I have omitted terms that do not contain 

the large enhancement factor In—~ 30. Now ctgut ~ 1/40 is fairly typical 
and c can easily be of order vr, if not for this process for some of the great many 

low energy processes in the PDG book. Not only is the 1-loop correction large, 

m2 

0 (100%), but at n-loops there will be a correction of order (^^/^ In 

If you can account for all of the terms of the form (^^^)^ln— t^)”, say by 
summing the corresponding Y2n Cn(^^/^ In —then the next order gives cor¬ 
rections of the form c ' f qgut -^gut \n^ j£ qqut ^gut ^ £ then these 

subleading corrections are of order ~ 1 /ln—~ 1/30. Nice. All we need 

to do to get per-cent accuracy is to sum those “leading-logs.” But failing to do so 
we incur in 100% errors. 

The EFT technique takes advantage of the simpler form of the RGE when there 
is only one relevant scale (one at a time!) in the problem, to sum the leading-logs 

A/f2 

(LL) and if needed the next to leading-logs (NLL), i.e., 

3.2 Intuitive EFT 

We can get a good preview of what EFT is about from general considerations. 
Suppose you have some particles whose interactions you are studying in a particle 
accelerator with GM energy E. Obviously all these particles, including the colliding 
particles and those produced in the collisions, have masses less than E. Let’s call 
these particles “light.” Suppose further that you know of the existence of a particle 
that interacts with the light particles, but has mass M much larger than E] it’s 
a “heavy” particle. You also know that the interactions among all these particles 
are well described by a local, Lorentz invariant, renormalizable QFT. We call this 
the “full” theory. 

While you could take the full theory to calculate reaction rates among the light 
particles, using the full QFT including the heavy particle may strike you as overkill. 
Since the heavy particle cannot be used in the collider, nor can it be produced by 
the collisions, can we just ignore its presence? That is, delete it from the theory? 
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Clearly the heavy particle can affect collision rates among light particles through 
its virtual effects. The space-time separation between the point at which the vir¬ 
tual heavy particle is created and the one at which it is annihilated is of order 
1/M. To be sure, the exchange of the heavy particle produces what appears as a 
non-local interaction among the light particles. But the scale of the non-locality is 
very short, of order 1/M, and we can Taylor expand in this parameter so that on 
distance scales L ^ 1/M, the interaction appears local: the expansion is in powers 
of 1/(ML). So if we are prepared to say that such a separation is undetectable 
in low energy collisions we can model the effects of the virtual exchange by local 
operators. Some of the local operators are of dimension 4 and would come into the 
Lagrangian with dimensionless coefficients. Some of these may already exist in the 
Lagrangian of our full QFT. In fact, if the full Lagrangian contained all possible 
dimension 4 operators there are no new operators of this dimension to add. But 
are the coefficients modihed, and if so, how significantly? 

By dimensional analysis the effects of the particle of mass M should come into 
scattering amplitudes only through the logarithm of this mass, say ln(M//i), where 
^ is a renormalization scale. Powers of E/M are also allowed, but powers of E 
translate into powers of momenta, hence derivatives on local operators, and this 
means higher dimension operators. So the Lagrangian is modified by allowing the 
dimensionless coefficients of dimension 4 operators to be functions of ln(M/;u). 
Even if we do not know these functions, we can model the low energy interactions 
by writing the most general renormalizable Lagrangian for interactions among the 
light particles. Knowledge of the full QFT, including the heavy particle, simply 
gives us additional constraints on the dimensionless couplings of this Lagrangian. 
This is fine provided we do not need accuracy better than E/M in our predictions 
of scattering amplitudes. 

Sometimes, however, there are processes that are simply not allowed by the 
renormalizable Lagrangian of the light particles, but are allowed if we include op¬ 
erators of dimension higher than 4. This can happen also as a result of a virtual 
exchange of a heavy particle. The effect is captured in the low energy Lagrangian 
by including higher dimension operators (that is, of dimension higher than 4) with 
dimensional coefficients made up with inverse powers of M. Hence, our “effective” 
Lagrangian for interactions among the light particles is the most general renor¬ 
malizable Lagrangian for the light particles, consistent with Lorentz and gauge 
symmetries, supplemented by higher dimensional operators accompanied by in¬ 
verse powers of M. 

There is one more insight that is worth dwelling on. Suppose you want to 
compute the scattering amplitude for some light particles to accuracy E/M. To 
make our discussion simpler let’s assume that the scattering amplitude vanishes 
identically if only renormalizable interactions are retained in your effective theory. 
That is, it vanishes in order (E/M)^. That is, it vanishes as M —)■ oo. Then to get 
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a non-zero amplitude you must use the interaction terms that include one power 
of 1/M. And any one of these interactions may come in only once, else you will 
get more powers of 1/M. Think about it this way: you do first order perturba¬ 
tion theory in these interactions, but compute to all orders in the renormalizable 
interactions. Then there are many-loop contributions that probe the 1/M vertex 
at arbitrarily short distances. It would seem we have a problem, since the premise 
of our construction is that we could expand the non-local interactions on long dis¬ 
tances L in powers of 1/(ML). And here is the crux of the matter. In these loops, 
when the virtual (light) particles probe long distances, the approximations we have 
made work well by design. And while the approximation fails when the virtual par¬ 
ticles probe short distances, the failure can be modeled by a local interaction. But 
we already have all possible local interactions in our effective Lagrangian. All we 
need to do is to choose their interaction strengths (a.k.a. operator coefficients, or 
coupling constants) appropriately so as to reproduce the scattering amplitude of 
the full theory. 

In fact, you can go one step further. In the absence of detailed knowledge of 
the full QFT, we have argued on general grounds that the physics of the light 
particles is described by a non-renormalizable Lagrangian, the most general sum 
of local operators consistent with symmetries with the coefficients of operators of 
dimension 4 -|- n including a factor of 1 /M”. This should approximate extremely 
well any amplitude provided the energies involved are small compared to M. The 
mass scale M acts, in effect, as an energy/momentum cut-off.^ 

Let’s make these ideas somewhat more precise. 


3.3 The Appelquist-Carazzone Decoupling Theorem 


(Note: there is no wikipedia page for this: here is your chance to make a mark!) 
While we call this a ‘theorem’ neither them nor I are mathematicians, so do not 
expect a ‘proof.’ Instead we will explain what it is and how it works [24]. Consider 
a model with a renormalizable Lagrangian 


-^full — -flight + 5 + T(/,_light 


>Ciight may include many fields but all with mass <C M. It depends on coupling 
constants, which we take to be dimensionless parameters, gt, and possibly also 
masses rrij. T^-Ught has the interactions between (j) and the light-fields and depends 
on Qi and possibly on additional couplings hi. 

^The alert reader will notice that if we apply this prescription to operators of dimension less 
than 4 we will render the light particles as heavy as M, and the whole procedure meaningless. 
This is the hierarchy problem, that the Higgs mass term ought to be as large as the scale of the 
cut-off. 
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Consider Green functions ... ,pn) (or, better yet, amplitudes) of the 

light fields (associated with light particles) restricted to \pi\ <C M (all components 
of all momenta). Then 

G(-)(pi,... ,pn) = ztGW(pi,... ,p0(1 + Oil/M)) 

where G^'^\pi,... ,pn) is computed from £efr = flight, where Tught is a renormal- 
izable Lagrangian constructed out of the helds in Tught, with new (possibly more) 
couplings gi. The gi = gi{{gi},M) and Z = Z{{gi},M) are not functions of mo¬ 
menta, only of the indicated arguments, and are universal. That is, the same g 
and Z appear in any Green function. To be sure, there is a different Z for each 
light field, and the factor Z 2 really stands for a Z 2 for each field in the Green 
function, but as we will see shortly this is of little consequence (so I will continue 
to be sloppy in the way the factor is included). 

The meaning is clear, heavy particles appear in only through virtual effects, 
by construction. At large M (M » m, |p|) the effects of M decrease as powers of 
1/M, except possibly when M appears in logarithms. The content of the decoupling 
theorem is that (i) there are no positive powers of M, and (ii) the ln(M) terms can 
all be absorbed into g and Z. 

For the theorem to work you have to be able to take M arbitrarily large holding 
gi constant. It fails when M = gv, the case where the field (p and some of the 
light fields acquire their mass solely from spontaneous symmetry breaking, because 
either n —)• 00 and all particles get heavy, 01 gi ^ 00 together with M, so that the 
order 1/M corrections can go as g/M = 1/v = fixed. 

Concretely, lets consider GUTs. Say SU{5). We can apply decoupling to the 
McuT-heavy fields. Then by construction Ces is just the SM with couplings 51,2,3 = 
5 i, 2,3(5 gut, Mqut)- To understand what is going on consider again e'''e“ —)■ 



The effect of the loop of GUT-heavy particles differentiates the photon and Z from 
gluons by differences in /p?), because the photon and Z couple differently 

than gluons to GUT-heavy particles. Of course 7/Z couple differently than gluons 
to light particles, but at this point that difference is not from the coupling constant, 
which is the common 5 gut- The photon (or Z, or gluon) self-energy, p^Il{p^) has 
two contributions, roughly 


n(p2 


«GUT M^vt 


TT 




+ 5?™Iln7+0 

TT p^ 


p- 


M^ut 



3.4. BEYOND Clight- 
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The first term is from the GUT heavy particles while the second, which I call 11 
below, is from effectively massless particles. So, neglecting powers of p^/MQ^rp the 

jV^2 

new coupling has been shifted by In —_ More explicitly, if the tree-level (Born) 
amplitude is then the self-energy correction to the one-loop 

amplitude is 


9gvt ^4(0 ) ^ [ ^gut ] ( ^ ^ ^(0) 

p2(i + n) +Vp2(i + n)y 


(3.1) 


We have used the fact that the product of the GUT-heavy and massless particles 
contributions to IT is higher order in ogut/t’' and hence can be neglected to the 
order we are working. Now in (3.1) the first term effectively changes the value of 
the coupling constant, 

gl ^ )- alvT 

aCrUi » I OGUT Mgut 

' TT /X 


while the second term is the contribution of light particles to the self energy. Both 
terms depend on the renormalization scale, but in such a way that once we account 
for self-energies of the external states (electrons and muons) the amplitude is p- 
independent. We see that the amplitude could be computed from the effective 
Lagrangian, Tughtj which is just the Lagrangian of the SM provided the coupling 
constant used is g. We still have to account for the effect of self-energies on the 
external particles. This can also be broken into a contribution from GUT-heavy 
particles, that goes into the factor of Z2, and a contribution from light ones, and 
the latter is produced by Tught: 


7, ^ GUT 





«GUT ^GUt \ 
TT g J 



+ ■■■ 


Thus, we see, the EFT formalism is much like a factorization theorem, except that 
not really, because the elective couplings, g, appear everywhere. 


3.4 Beyond flight• 

Tiight is the most general renormalizable Lagrangian of the light fields. It may be 
less general if exact symmetries of the full Lagrangian, Tfuii, forbid some terms in 
the effective Lagrangian. But there are processes described by Tfuii involving only 
external light fields that may be absent from Tught- For example, 



















56 


CHAPTER 3. EFFECTIVE FIELD THEORY 



There are no terms in Tught one can write, consistent with gauge symmetry and 
renormalizability, that reproduce these. In fact, the SM Lagrangian respects baryon 
and lepton number separately while these processes require breaking of both sym¬ 
metries. To obtain this we must supplement the effective Lagrangian with addi¬ 
tional non-renormalizable terms. 


CeS - Aight + + • • • (3.2) 

where consists of a sum of operators of dimension n constructed out of light 
fields and coefficients that depend on the couplings gi. 

It is easy to see how this works at tree level: 




P/iPi^ /^gut a 
- M^ut / 




9gut 

M^ut 


JfT 


2 9/J,iy 


where stand for currents that couple to the GUT-heavy gauge bosons and in 
the last step we have used \p\ <C Mgut- So we have found a tree level contribution, 
£(6) = — 


3.5 Beyond Tree Level^ 

We have established that in GUT for some AB = AL ^ 0 4-pt functions. 


G^'^Kpi, ■ ■ ■ ,P4) = {T)&‘^\pi, ... ,P4) H- 


where the right hand side is computed in an effective theory with light fields only, 
with Lagrangian £eff- The same applies to the weak interactions if the exchange 
particle is a W/Z and the external states are quarks and leptons (except for the 
very heavy top quark) and the energies involved are all small compared to the W 
and Z masses. Diagrammatically 

► 

W 

< I ^ 



^Much of this section is based on Ref. [25] 
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We have seen this before in (2.15), albeit the argument there was somewhat infor¬ 
mal. Of course, the equation has corrections of relative order /M'^ (I will use M 
for the heavy mass from here on, which can be a W oi a Z mass). So we can make 
it formally correct by writing: 



Our task presently is to extend this argument beyond tree level. The 1-loop 
corrections we are concerned about are from gluon exchange among the quarks, 
since the strong coupling constant is the largest among the couplings in the SM. 
Moreover, the coupling constant grows larger the lower the energy and we want to 
use this technique for low energy processes, e.g., H-meson decays. The treatment 
of photon mediated loops is entirely analogous. We will discuss the effect of loops 
with W/Z's below. We want to show 



At least for now graphs on the right hand side (RHS) are in one-to-one corre- 
spondance with those on the LHS. So let’s compare one at a time. We ask hrst 




+ ^im) 


Keep in mind the first term on the RHS has an explicit factor of 1/M^. It is easy 
to see that this equation is non-sensical. The left hand side, sketchily, is 

This is indeed a sketch. We left out the interaction vertices and set all external 
momentum to zero. The point we are trying to make is simple. This integral is 
clearly UV convergent. As A: —)• oo the integrand of the 4-dimensional integral is 
~ 1/A:®. It’s a different story for the RHS, 
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The crucial difference (the only difference!) is that the Vl^-propagator has been 
replaced by a constant, which now gives, as k —>■ oo, an integrand ~ be., 

logarithmically divergent. The equation makes no sense, one side is a number the 
other is formally infinite. 

Oops. We need to renormalize. But before that we can see if the finite part 
of the infinite RHS has the correct dependence on external momenta, pi, up to 
some trivial pi dependence in the infinite part. By “correct dependence” I mean, 
of course, that it reproduces the pi dependence of the LHS. 

To this end take djdp^ on both sides of the would-be-equation (obviously now 
restoring the external momentum and, if need be, the small masses of the light 
fields). Now, when the external momentum pi appears in the propagator of some 
internal light-held inside the loop (an internal line), then the action of djdp^ on 
this propagator increases the degree of convergence. For example, for a scalar held, 

d 1 + p^^ 1 

-^ —2 - K —~ 

dpfi {k -f p)^ — 

Diagrammatically, 

V J 

where the vertex, indicated as a cross, 
other propagators. 


[{k + pY — 772,2] k 


k + p 




(3.3) 


tands for 2i{k + p)^. The same is true for 


Exercises 

Exercise 3.5-9: Show this. That is, that djdp^ on an internal quark or gluon prop¬ 
agator changes the loop momentum UV asymptotic scaling of the propagator from 
l/k and l/A:^ to l/fc^ and 1/fc^, respectively. Show that the result can be expressed 
diagrammatically just as in (3.3), with the appropriate choice for the vertex indicated 
as a cross. 

Returning to the question at hand, we take a derivative and ask whether the 
following holds: 



To see that this actually does hold, consider 
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For simplicity I have chosen the routing of the momentum through the loop so 
that the momentum of the W is the loop momentum k. I have indicated the rest 
of the factors in the diagram by “(• • •).” All we need to know about this factor 
is that (i) it is common to the two sides of the equation we are studying and 
(ii) it contains sufficient inverse powers of k to render them finite. Now, we are 
taking two limits. One is the explicit one, and the other one is in the limits of 
integration of the integral over momenta. We can ask whether the order of taking 
the limits matter. Well, limA^-s-oo ^ fd‘^kM‘^{- ■ ■) ^ converges uniformly, and 

so does ^d^kYmiM^ and by standard mathematical non-sense 

both converge to the same limit. But the latter is simply the result of the effective 
theory calculation since limM^oo (M^(- • •) ov, diagrammatically, 



As we saw earlier, this is what we mean formally by equality up to corrections that 
vanish as 1 /M of the full and effective (once-differentiated) graphs. 

Next, do this for each propagator on which djdpi acts and for each momenta: 
think of these as different graphs with one insertion of the “cross” vertex. Do it 
also for each non-vanishing light mass that may appear in loop propagators. The 
same argument goes through, graph by graph, in establishing the equality between 
full and effective theories.^ We conclude then that 



and similarly for light field mass derivatives. Integrating we get 


The difference C in the amputated Green functions, which we know is infinite, 
must vanish upon differentiation with respect to the external momenta or the light 
masses. C may depend on gg, Qw and M (really g 2 and M\y). And it must have 
the same chiral structure as the other terms, 7^(1 — 75) ® 7/^(1 — 75) which, up to 
a numerical factor is just the tree level amputated Green function in the effective 
theory. So writing 




^Maybe it’s worth emphasizing that we are only considering insertion of the “cross” in internal 
propagators. On external propagators the degree of divergence of the graph is not reduced. You 
can alternatively consider only amputated Green functions. 
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and noting that c is oder a-j/vr, we have 

g 

Note that c is infinite and this equation holds because it cancels the infinity in the 
1-loop diagram on the RHS. That is, it is the counter-term to that 1-loop diagram. 

This goes through when we include all 1-loop graphs that contribute to the 
amputated Green functions. So we obtain, at least at 1-loop, 

... ,P4) = ^T»f^^(pi,...,p4)H- 

where 

r = amputated Green function, renormalized 
r = idem in the EFT with Lagrangian Tught 
To = idem with a zero momentum insertion of the 4-quark operator O 
D = finite coefficient, of order a^/vr 
• • • = higher order in 1 /M^ 

I hope you recognize this is precisely the statement that Tgff of Eq. 3.2 gives the 
desired approximation to the Green function. 

Comments: 

1. D has an expansion, D = 1 + di^ + d 2 it may depend on M but 

not on Pi,..., p4 nor rrij. 

2. The dependence on pi,... ,p 4 and on nij of the full and EFT are the same. 
They have the same analytic structure (same cuts, poles, residues, and what 
not), provided the approximations we have made (such as \pi\ <C M) are 
valid. But the full and EFT may differ badly when the approximation fails, 
for example, for |pj| ~ M. 

3. The above result, we already stated, is summarized by Eq. 3.2, or more 
specifically, £efr = flight + tjt —DO -!-•••. It should be clear that one is to 

compute in the EFT to all orders in the interactions in Tught but only to hrst 

order in -^ 77 — DO. This is the statement that the Green function with one 

Mqut 

insertion of the operator is equal in the full and effective theories. So for may 
applications we find we need 

amplitude OC (V’finall'^intl'^initial) ^ ('^finallV^initial) 
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Many of the difficulties in obtaining good predictions in flavor physics arise 
from the need to calculate the matrix elements of operators. 

4. One has to analyze and consider separately the case with two (or more) 
virtual heavy particles, as is the case, for example, for the box diagram that 
gives rise to neutral meson mixing. 


3.6 RGE improvement 


Let’s inspect the function D = D(M,gs) more closely. It is dimensionless, so it 
depends on M only through the ratio M/fi, with /r the renormalization scale. Of 
course, D depends on /i but it is customary to leave this as implicit dependence. 
Now, the fact that amplitudes are /x-independent immediately gives us 

7 

Therefore, if 

= 7c)Cl then — ~loD. 

a/x dg. 

Slow down a little. 70 is called the anomalous dimension of the operator O. In 
a mass independent renormalization scheme, like dimensional regularization with 
MS (subtracting only the e = d — 4 poles), the anomalous dimension is only a 
function of the dimensionless couplings, in this case Qs. 

The amazing thing about this is that the whole M-dependence of the coefficient 
function D is fixed by the renormalization group! Let’s see how this works in some 
detail. In order to do so we solve the renormalization group equation (RGE). Since 
D = D{M/fi, Qs) the dependence on /x is either through the ratio of scales, or its 
logarithm, t = ln(/x/M), or implicitly through the coupling gg = 5s (/x). Then the 
RGE for D, Eq. 3.4, is explicitly 


We have used g for the strong coupling gg to streamline notation, and I3{g) is the 
beta function for the strong coupling constant. The solution to this equation is 
well known, but we review it here. We first introduce an auxiliary function, the 
running coupling constant g = g{t,g), defined as the solution to 


dgjt) 

dt 




with boundary condition 5(0, g) = g. 


Exercises 
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Exercise 3.6-10: Show that 


dg Pig) 


dg Pig)' 

Use this to show that the solution to the homogeneous RGE 


]Fit,g)=0 


dg 


is Fit,g) = figi—t,g)). That is, t and g can come into the arbitrary function / only 
in the combination gi—t,g). Make sure you get that sign in the argument of g right! 
Verify that the function / can be thought of as the boundary condition fig) = F(0, g). 

If you solved the previous exercise then it is not difficult to verify that the 
solution to the RGE for D is 


Dit,g) = DiO,gi-t,g))exp 
For example, at 1-loop one has 


4 


loig) = ai 


g^ 


IOtt^ 


and Pig) = -bo 


IGtt^ 


Then, to this order, 


dg _ , g^ 

dt IGvr^ 


1 


1 


where, as usual, a = g"^ jH:, and then 


bo 

-I_ Lf 

ait) Q'(O) 27r 


dg' ai ai ^ «(—t) 

n g' bo 2bo a(0) 


so that 


D(f,i,) = O(0,9(-f))(^ 


°1 

'26o 


There is some standard language that goes with this. We say we obtain the 
EFT by “integrating out” the heavy fields (or sometimes, the heavy “degrees of 


«(-*) 

5 ( 0 ) 


° is ob- 


freedom”) — in this case the W vector boson. The factor 
tained by “running” while the computation of the pre-factor Di0,gi—t)) is called 
“matching.” The matching computation can be performed by taking t = 0, as an 
expansion in the coupling constant at short distances. 


a 


-C*(0)fl'(0)) — 4?(0,5') — Dq+ • 


(3.6) 
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Notice that this is computed at t = 0, that is, at /U = M. This means that in the 
calculation one may encounter large logs of pi or rrij over p. But the coefficient 
function D is independent of these low energy variables. By taking t = 0 the 
matching calculation consists of computing functions of one variable only, namely 
a, and as indicated in (3.6) this can be done perturbatively. The point is that 
“matching” calculations ensure that the full and effective theories give the same 
answer, while “running” gives the full dependence on the renormalization scale p 
and, as we will see, re-sums logs. For example, the coefficient Dq is computed from 
comparing the leading order contribution to the amputated Green functions, as in 



and the next order coefficient, Di, is computed form comparing 1-loop graphs. 



where the ellipsis stand for all other one-gluon exchange 1-loop diagrams (on both 
sides of the equation). Here by Di we mean the finite part of the counter-term, 
well defined in any specific renormalization scheme, e.g., MS in dimensional regu¬ 
larization. 

Comments: 


1. From 


we have 


a{—t) 

d(0) 


°i 

'260 


q;(0) 

a{—t) 


1 


« 7 7 


M 






260 


a M 

1 -I- ui — In-h 

dvr p 


(3.7) 


The ellipsis stand for an infinite sum of terms of the form (^6o InM///)”. 
This is why we say the running gives a re-summation of the leading logs. 
Also, the n = 1 term, displayed explicitly, is precisely what determines the 
anomalous dimension 70 at 1-loop. Note, finally, that in the expansion we 
cannot say whether a is evaluated at M or at p, and we have therefore simply 
denoted it as a. The EFT method has allowed us to make sense of the scale 
at which a is evaluated. 


2. In D ('0finai|O|V’initiai) we have cleanly separated the physical scales. This is 
why we introduced the EFT to begin with. The coefficient D depends on 
In M/p, while the matrix element is independent of M and contains 
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In^u only through In^/E where E stands for any of the low energy scales 
associated with the physical process. This shows that we have to deal with 
large logs: either in D if we take fi E or in the matrix element if we 
take fi ~ M. We have explained above that we can compute D even in the 
presence of large \nM/E perturbatively, provided (5(0) is sufficiently small 
for perturbation theory to work. This is clearly the case for M = the mass 
of the WjZ vector bosons. The calculation of D at low n is valid provided 
the leading log expansion continues to make sense, so that a{—t) remains 
sufficiently small, although possibly very different form q;( 0). In practice n 
is commonly taken to be as low as 2 GeV (or sometimes even 1 GeV), and 
we let the non-perturbative method that determines the matrix element deal 
with the not so large In^/E. Even for E = Aqcd ~ 300 MeV, this is a 
reasonably small ratio of scales. 

3. Operator mixing; The discussion above has been very sketchy. In most cases 
we have to deal with more than one operator in the EFT Lagrangian at the 
time. A simple example is in the EFT for charm decay. Let’s take 


Ol = ClYsL dLl^lUL = 



(3.8) 


I have drawn the figure with the local vertex slightly separated as if there 
were two distinct vertices, when they are not. It is simply a device to indicate 
color flow (which also corresponds in this case to flow of the indices of the 
Dirac spinors). What I mean by this is that we have kept the color indices 
implicit. To be sure, the operator with explicit color indices indicated is Or = 
with the sum over i and j from 1 to W = 3 understood. 
This operator is the one you get from tree-level matching by integrating out 
the W vector-boson. Now, at 1-loop the graph 



requires a counter-term that, even without computing can be seen to be an 
operator 

O2 = clI^T^sl dLl^.T<^UL (3.9) 

where T“ are the generators of SU{Nc) and the quark (and r“) color-indices 
and the sum over a are implicit. This means that when we compute “running” 



3.6. RGE IMPROVEMENT 


65 


a term with O 2 will be generate in the EFT Lagrangian. More explicitly, we 
write 

£( 6 ) = ^ D^Oi 
i=l,2 

and the requirement that amplitudes be ^ independent is the same as 
^dDiOi/dn = 0 (sum on repeated i-indices understood from here on). Then 
the RGE for the operators is 


u—Oi = 'lijOj so that 
a// 




where 7 is now a 2 x 2 matrix function of Qs. The non-vanishing counter¬ 
term of the form O 2 gives 712 7^ 0, and the RGE then contains a term 
IJ,dD 2 /dfi oc Di which produces a non-vanishing D 2 {t) even if T>2(0) = 0. 

4. Tie some loose ends. In our discussion of K^-K^ mixing we postponed dis¬ 
cussion of how to handle the two loop graphs with gluons attaching to the 
internal IT-box diagram. Consider, for example, the first Feynman diagram 
in (2.18). We can think physically of what is going on here. When all the 
loop momenta is high (meaning, of order of M or larger) the whole diagram 
is effectively a point-like 4-quark interaction. This gives then a contribution 
to the 4-quark operator by matching, 



What about the case when the gluon in the loop carries low momentum? We 
can split the computation into two parts. First, compute the matching to 
the operator that results from cutting open the gluon line: 




This graph matches to an operator with 4-quarks and two gluons, which 
is a dimension-8 operator (at least). The evaluation of the matrix element 
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of this operator will include a contribution in which the gluons reconnect. 
At low energies this precisely reproduces the low energy part of the first 
diagram in (2.18). At high energies it gives again a local operator which is 
properly accounted for in the 4-quark operator if we have done the complete 
matching calculation correctly. Most importantly, since this is a dimension-8 
operator it appears in the EFT Lagrangian with a coefficient 1/M^, and 
is therefore suppressed by <C 1 relative to the contribution of the 

4-quark operator. Notice, incidentally, that I labeled the coefficient with a 
prime to make sure we distinguish this from the 4-quark operator. 


Exercises ^ 

Exercise 3.6-11: Consider the operator that occurs in K^-K mixing, namely, O = 
slJ^cIl SL^^dL- Argue, without an explicit computation, that it does not mix with 
other operators under renormalization, so that 70 is a 1 x 1 matrix. Find 70. Note: 
this is not an EFT calculation, but an exercise in perturbative QCD. You must com¬ 
pute the renormalized Green function with an insertion of the operator and then take 
the logarithmic /i derivative. If you have never seen this, you should first consult a text 
on Quantum Field Theory that covers this (for example, Peskin and Schroeder [29], 
section 12.4). 

Exercise 3.6-12: The operator in O 2 in (3.9) was introduced out of necessity: it 
was found to be a counter-termcounter-termcounter-termcounter-term to the operator 
produced by the weak interactions at tree level, Oi of (3.8). 

(i) Show that this set of operators closes under renormalization. That is, no new 
operators need be introduced to renormalize this pair. This justifies using a 2 x 2 
matrix of anomalous dimensions. You should be able to argue from symmetry 
and power counting, and therefore construct an all orders argument. Note: we 
are assuming we are working in a mass independent subtraction scheme, like MS 
in dimensional regularization, so that operators only mix with operators of the 
same dimension. 

(ii) Show that the set of operators Qi = Oi and Q 2 = clJ^ul dh^^SL is equivalent 
to the set 01,2 and give the explicit relation. We sometimes speak of these 
as different “basis” of operators (we can think of the space of operators as a 
linear vector space, since we can form linear combinations of them, and the 
computation of Green functions with one insertion respects linearity). 

Exercise 3.6-13: Our discussion of EFT used as a prototype the exchange of a heavy 
field (a W) to produce a dimension-greater-than-4 operator in the effective Lagrangian 
(a dimension-6 4-quark operator in the case of W exchange). For K^-K^ mixing, or 
more generally for any neutral meson mixing, the full process in the full theory is 
from a loop diagram containing two W propagators; see the “box” diagram in (2.14). 

(i) Consider the box diagram for fixed light internal quarks; for concreteness take 
both internal quark lines to be charm quarks. Use the methods of Sec. 3.5 to 
argue that even though the graph contains two W propagators, it produces only 
one factor of 1/Af^ in the coefficient of the EFT’s 4-quark operator. 



3.6. RGE IMPROVEMENT 


67 


(ii) Now include the other quarks. Work in the fictitious case that the top quark is 
much lighter than the W so that the old-fashion GIM mechanism applies; see 
Sec. 1.6.1. Show that now the coefficient of the 4-quark operator has a 1/M^. 
What makes up the dimensions so the EFT Lagrangian is still of dimension-4, 
as it must? 

Exercise 3.6-14: In the comments above we showed how to resolve the problem of 
light degrees of freedom inside loops. In particular we analyzed the first Feynman 
diagram in (2.18). 

(i) Analyze the other two diagrams in (2.18) and explain how to account for them 
in the EFT. 

(ii) The alert student may realize there are other light degrees of freedom in the 
loop: the light quarks! Consider for simplicity the box diagram (2.14) for the 
case that one internal quark is light and the other is the top quark. It makes 
sense to integrate out the top quark simultaneously with the W (we incur in 
errors from failing to distinguish a{Mw) from a (to*); we can live with that for 
now!). Analyze this case. What is the dimension of the operator that needs to 
be included to account for low momentum in the light quark in the loop? 
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